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COVALENT JOINING OF DNA STRANDS TO RNA STRANDS 
CATALYZED BY VACCINIA TOPOISOMERASE 

5 This application claims the benefit of copending U.S. 

Provisional Application Serial No. 60/049,405, filed June 
12, 1997. 

This invention was made with support under Grant 
10 No. GM46330 from the National Institutes of Health, U.S. 

Department of Health and Human Services. Accordingly, the 
United States Government has certain rights in the 
invention. 

Throughout this application, various references are 
referred to within parentheses. Disclosures of these 
publications in their entireties are hereby incorporated by 
reference into this application to more fully describe the 
state of the art to which this invention pertains. Full 
bibliographic citations for these references may be found 
at the end of this application, preceding the sequence 
listing and claims. 

Background of the Invention 
25 Vaccinia topoisomerase binds duplex DNA and forms a 

covalent DNA- ( 3 ' -phosphotyrosyl ) -protein adduct at the 
sequence 5 1 -CCCTT 1 . The enzyme reacts readily with a 36-mer 
CCCTT strand (DNA-p-RNA) composed of DNA 5' and RNA 3' of 
the scissile bond. However, a 36-mer composed of RNA 5' and 
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DNA 3' of the scissile phosphate (RNA-p-DNA) is a poor- 
substrate for covalent adduct formation. Vaccinia 
topoisomerase efficiently transfers covalently held CCCTT- 
containing DNA to 5 1 -OH terminated RNA acceptors; the 
5 topoisomerase can therefore be used to tag the 5' end of 

RNA in vitro. 

Religation of the covalently bound CCCTT- containing DNA 
strand to a 5 1 -OH terminated DNA acceptor is efficient and 
rapid ( A' reI > 0.5 sec*") , provided that the acceptor DNA is 
capable of base-pairing to the noncleaved DNA strand of the 
topoisomerase -DNA donor complex. The rate of strand 
transfer to DNA ■ is not detectably affected by base 
mismatches at the 5' nucleotide of the acceptor strand. 
Nucleotide deletions and insertions at the ' 5 ' end of the 
acceptor slow the "rate of religation; . the observed 
hierarchy of reaction rates is: +1 insertion > -1 deletion 
> + 2 insertion >> -2 deletion. These findings underscore 
the importance of a properly positioned 5' OH terminus in 
transesterif icat ion reaction chemistry, but also raise the 
possibility that topoisomerase may generate mutations by 
sealing DNA molecules with mispaired or unpaired ends. 

Vaccinia topoisomerase, a 314-amino acid eukaryotic type I 
25 enzyme, binds and cleaves duplex DNA at a specific target 

sequence 5 ' - (T/C) CCTT 1 (1-3). Cleavage is a 

transesterif icat.ion reaction in which the Tp'N 
phosphodiester , is attacked by Tyr-274 of the enzyme, 

BNSDOClD:<WO 9856943A1 _IB> 
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resulting in the formation of a DNA- (3 ' -phosphotyrosyl ) 
protein adduct (4). The covalently bound topoisomerase 
catalyzes a variety of DNA strand transfer reactions. It 
can religate the. CCCTT-containing strand across the same 
bond originally cleaved (as occurs during the relaxation . of 
supercoiled DNA) or it can ligate the strand to a 
heterologous acceptor DNA 5' end, thereby creating a 
recombinant molecule (5-7) . 

Duplex DNA substrates containing a single CCCTT target site 
have been used to dissect the cleavage and strand transfer 
steps. A cleavage -religat ion equilibrium, is established 
when topoisomerase t ransesterif ies to DNA ligands 
containing £l8-bp of duplex DNA 3 * of the cleavage site (8- 
11) . The reaction is in equilibrium because the 5 ' -OH 
terminated distal segment of the scissile strand remains 
poised near the active site by virtue of the fact that it 
is stably base-paired with the nonscissile strand. About 
20% of the CCCTT- containing strand is covalently bound at 
equilibrium (11) . "Suicide" cleavage occurs when the CCCTT- 
containing substrate contains no more than fifteen base 
pairs 3' of the scissile bond, because the short leaving 
strand dissociates from the protein-DNA complex. In enzyme 
excess, >90% of the suicide substrate is cleaved (11) . 

. The suicide intermediate can transfer the incised CCCTT 
strand to a DNA acceptor. Intramolecular strand transfer 
occurs when the 5 1 -OH end of the noncleaved strand of the 
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suicide "intermediate attacks the 3 ' phosphotyrosyl bond and 
expels Tyr-274 as the leaving group. This results in 
formation of . a hairpin DNA loop (5) . Intermolecular 
religation occurs when the suicide intermediate is provided 
5 with an exogenous 5 ■ -OH terminated acceptor strand, the 

sequence of which is complementary to the single, strand 
tail of the noncleaved strand in the- immediate vicinity of 
the scissile phosphate (5) . In the absence of an acceptor 
strand, the topoisomerase can transfer- the CCCTT strand to 

10 water, releasing a 3 1 -phosphate- terminated hydrolysis 

produc-, or to glycerol, releasing a 3 ' -phosphoglycerol 
derivative (12). Although the hydrolysis and glycerololysis 
reactions are much slower than religation to a DNA acceptor 
strand, the extent of strand transfer to non-DNA 

15 nucleophiles can be as high as 15-40%. 

The specificity of vaccinia topoisomerase in DNA cleavage 
and its versatility in strand transfer have inspired 
topoisomerase-based strategies for polynucleotide synthesis 

20 in which DNA oligonucleotides containing CCCTT cleavage 

sites serve as activated linkers for the joining of other 
DNA molecules with compatible termini (13). The present 
study examines the ability of the vaccinia topoisomerase to 
cleave and re j oin ' RNA- containing polynucleotides. It was 

25 shown previously that the enzyme did not bind covalently to 

CCCTT- containing molecules in which either the scissile 
strand or the complementary strand was composed entirely of 
RNA (9) . To further explore the pentose sugar specificity 
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of the enzyme, we have prepared synthetic CCCTT-containing 
substrates in which the scissile strand is composed of DNA- 
and RNA-containing halves. In this way, we show that the 
enzyme is indifferent to RNA downstream of the scissile 
phosphate, but is does not form the covalent complex when 
the region 5* of the scissile phosphate is in >RNA form. 
Also assessed is the contribution of base -pair ing by the 5' 
end of the acceptor strand to the rate of the DNA strand 
transfer reaction. 

Summary of the Invention 

The present invention provides a method of covalently 
joining a DNA strand to an RNA strand comprising (a) 
forming a topoisomerase-DNA intermediate by incubating a 
DNA cleavage substrate comprising a topoisomerase cleavage 
site with a topoisomerase specific for that site, wherein 
the topoisomerase-DNA intermediate has one or more 5' 
single-strand tails; and (b) adding to the topoisomerase- 
DNA intermediate an acceptor RNA strand complementary to 
the 5' single-strand tail under conditions permitting a 
ligation of the 5' single-strand tail of the topoisomerase- 
DNA .intermediate to the RNA acceptor strand and 
dissociation of the topoisomerase, thereby covalently 
joining the DNA strand to the RNA strand. The DNA cleavage 
substrate may be created by hybridizing a DNA strand having 
a topoisomerase cleavage site to one or more complementary 
DNA strands , thereby forming a DNA cleavage substrate 
having a topoisomerase cleavage site and a oligonucleotide 
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leaving group located 3' of a scissile bond or may be a 
plasmid vector comprising a topoisomerase cleavage site. 

The present invention also provides a covalent 
topoisomerase -DNA intermediate having a 5' s ingle - strand 
tail. 

Another aspect of the present invention provides a DNA-RNA 
molecule covalently joined by topoisomerase catalysis. 

The present invention provides a covalently joined DNA-RNA 
molecule having a labeled 5' end. 



The present invention further- provides a method of tagging 
15 a 5' end of an RNA molecule comprising: (a) forming a 

topoisomerase -DNA intermediate by incubating a DNA cleavage 
substrate comprising a topoisomerase cleavage site with a 
topoisomerase specific for that site, wherein the 
topoisomerase -DNA intermediate has one or mor-.. 5* single- 
20 strand tails; and (b) adding to the topoisomerase-DNA 

intermediate a 5' -hydroxy 1 terminated RNA molecule 
complementary to the 5' single- strand tail under .conditions 
permitting a ligation of the covalently bound DNA strand of 
the topoisomerase-DNA intermediate to the RNA molecule and 
25 dissociation of the topoisomerase, thereby forming* a 5' end 

tagged DNA-RNA ligation product. The DNA cleavage 

substrate can be created, for example, by hybridizing a DNA 
strand having a topoisomerase cleavage site to a 
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complementary DNA strand, thereby forming ; a DNA cleavage 
substrate having a topoisomerase cleavage site and a 
oligonucleotide leaving group located 3 * of a scissile 
bond . 

5 

Another aspect of the present invention provides a 5 ' end 
tagged RNA molecule . 

In another aspect . the present invention also provides a 
10 DNA -RNA molecule which has been joined in vitro by the use 

of a topoisomerase. 

The present invention further provides a method of tagging 
a 5 1 end of a capped messenger RNA comprising: 

15 a) isolating niRNA from cells or a tissue; b) removing an 

RNA cap structure from the isolated mRNA, resulting' in a 
de-capped RNA; c) dephosphorylat ing the de-capped RNA, 
thereby forming a de-capped and dephosphorylat ed RNA; 
d) constructing a DNA cleavage substrate for topoisomerase 

20 having a topoisomerase cleavage site and a complementary 

strand, the complementary strand having a mixed or random 
base composition downstream of the topoisomerase cleavage 
site, the DNA cleavage substrate being designated as a DNA- 
(N) substrate; e) cleaving the DNA- (N) substrate with a 

25 topoisomerase, thereby forming a covalent topoisomerase- 

DNA - ( N ) M complex containing a 5 ! tail of mixed or random 
base composition on a nohcleaved strand; and f) incubating 
the cleaved covalent topoisomerase-DNA- (N)M complex with 
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the de-capped and dephosphorylated RNA formed in step (c) 
to form a 5* DNA-tagged DNA-RNA ligation product. 

As used herein the number of bases (N) of the DNA cleavage 
substrate, designated supra as a DNA- (N) substrate, may be - 
from one to four bases long. 

The present invention also provides a method of isolating 
and cloning a capped mRNA after subtraction of non-capped 
RNA comprising: a) isolating mRNA from cells or a tissue'; 
b) dephosphorylat ing the mRNA; c) incubating a cleaved 
topoisomerase -BioDNA- (N) complex with the dephosphorylated 
mRNA to form a 5' BioDNA- tagged DNA-RNA ligation product; 
d) removing the 5' BioDNA- tagged DNA-RNA ligation product 
and any unreacted cleaved topoisomerase-BioDNA- (N) complex 
by adsorption to streptavidin and recovering any 
nonadsorbed material, said material being enriched for RNA 
having a capped 5' end and being resistant to 
dephosphorylat ion in step (b) , thereby being unable to 
react with the cleaved topoisomerase-BioDNA- (N) complex; e) 
removing of the 5' end cap from the enriched RNA recovered 
from the nonadsorbed material in step (d) ; f ) 
dephosphorylating the de-capped RNA, thereby forming a de- 
capped and dephosphorylated RNA; g) incubating- a cleaved 
topoisomerase-BioDNA- (N) complex with the de-capped and 
dephosphorylated RNA to form a 5' BioDNA- tagged DNA-RNA 
ligation product; h) affinity purifying the 5'- DNA-tagged 
DNA-RNA ligation product; and i) PCR amplification of the 
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decapped and dephosphorylated RNA of the DNA-RNA ligation 
product using a sense primer corresponding to a scissile 
strand of the topoisomerase cleavage substrate 5 1 of the 
site of cleavage and an antisense primer, said antisense 
5 primer being complementary to either a 3' poly (A) tail or 

to an internal RNA sequence. 

The present invention also provides a method of obtaining 
full-length gene sequences comprising attaching a DNA tag 
10 to an isolated mRNA sequence and using the DNA-tagged mRNA 

as a template tox DNA synthesis. DNA may be further 
inserted into an expression vector and used to express 
recombinant protein . 

Brief Description, of the Figures 

Figure 1A-B. Topoisomerase cleavage of DNA- p- RNA and 

RNA - p - DNA strands. (A) The 36-bp substrate used in the 
cleavage reactions is shown, with the 3: P-labeled scissile 
phosphate indicated by the filled circle. The segments of 
the top strand flanking the scissile phosphate, which are 
either DNA or RNA, are bracketed ; the bottom strand is all- 
DNA. 1 (B) Reaction mixtures (20 \x\) containing 50 mM Tris- 
HC1 (pH 8.0), 0.2 pmol of substrate (either DNA- p- RNA or 
RNA-p-DNA) and topoisomerase as indicated were incubated at 
37°C for 10 min. Covalent adduct formation (% of input 
label transferred to the topoisomerase) is plotted as a 
function of the amount of enzyme added. 



20 
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Figure 2A-B. Kinetics of cleavage of RNA- containing 

36-mer substrates. Reaction, mixtures contained (per 20 /il) 
50 mM Tris-HCl (pH 8.0), 0.2 pmol of radiolabeled 36-mer 
substrate and 1 pmol of topoisomerase . Covalent adduct 
5 formation (% of input label transferred to ■ the 

topoisomerase) is plotted as a function of the time of 
incubation at 37°C. (A) Cleavage of DNA-p-DNA and DNA-p-RNA; 
x-axis in sec. (B) Cleavage of RNA-p-DNA; x-axis in min . 

10 Figure 3A-B. Strand transfer to an. RNA acceptor, 

(A) The structures of the covalent topoisomerase - DNA 
complex (suicide intermediate) and .the 18-mer acceptor 
strands (DNA or RNA) are shown. (B) Religation reactions 
were performed under single -turnover conditions as 

15 described under Materials and Methods. The extent of 

religation (expressed as the percent of input labeled DNA 
converted to the 30-mer strand transfer product) is plotted 
as a function of incubation time. 

20 Figure 4 . Analysis of the ■ strand transfer 

reaction products. Reaction mixtures (20 /xl) containing 50 
mM Tris-HCl (pH 8.0), 0.5 pmol of 5' -labeled suicide DNA 
cleavage substrate, and 2.5 pmol of topoisomerase were 
incubated at 37°C for 10 min. Strand transfer was then 

25 initiated by adding a 50 - fold . excess of the acceptor DNA 

(18-mer D; lanes 1 and 2) or acceptor RNA (18-mer R; lanes 
5 and 6), while simultaneously adjusting the mixtures to 
0.3 M NaCl . The religation reactions were quenched after a 
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10 min incubation by adding SDS to 0.2%. The samples were 
extracted with phenol/chloroform and ethanol -precipitated . 
The pellets were resuspended in either 12 fxl of 0 . 1M NaOH, 
1 mM EDTA (NaOH +) or 12 /zl of 10 mM Tris-HCl (pH 8.0), 1 
mM EDTA (NaOH -) . These samples were incubated at 37°C for 
16 h. Control samples containing the input 18-mer DNA 
substrate that had not been exposed to topoisomerase were 
treated in parallel (lanes 3 and 4) . The alkali - treated 
samples were neutralized by adding 1.2 /il of 1 M HCl . All 
samples were then ethanol-precipitated , resuspended in 
formamide, heated for 5 min at 95°C / and then 
electrophoresed through a 17% polyacrylamide gel containing 
7 M urea in TBE . An autoradiograph of the gel is shown. The 
positions of the' 30-mer religation product and the 18-mer 
input strand are indicated at the left. Alkaline hydrolysis 
of the RNA strand transfer reaction product (lane 6) 
yielded a discrete species denoted by the asterisk. 

Figure 5A-B. 5' DNA-tagging of RNA transcribed by T3 

RNA polymerase. (A) The structures of the covalent 
topoisomerase-DNA donor complex and the RNA acceptor are 
shown. The 5* single- strand tail of the suicide 
intermediate is complementary to the 18 nucleotides at the 
- 5' end of the T3 transcript. Reaction mixtures contained 
(per 15 fxl) 50 mM Tris-HCl (pH 8,0), 0.3 M NaCl , and 0.1 
pmol of " U P-GMP- labeled T3 transcript. (B) Religation was 
initiated by the addition of pre-formed topoisomerase-DNA 
donor (at a 10-fold molar excess over RNA acceptor). 
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Incubation was at 37°C. Aliquots (15 /zl) were removed at the 
times indicated and quenched immediately by adding SDS and 
EDTA. The samples were adjusted to 50% formamide, heated 
for 5 min at 95°C / and electrophoresed through a 12% 
polyacrylamide gel containing 7 M urea in TBE . Transfer of 
the 12 -nucleotide DNA donor strand to the 5' end of the 
labeled 3 6-mer T3 transcript yielded a labeled 48-mer 
product. Conversion of input 36-mer to 48-mer was 
quantitated by scanning the gel with a phosphorimager . 

Figure 6A-C. Kinetics of topoisomerase-catalyzed 

strand transfer reactions resulting in DNA deletions and 
insertions. (A) The structure of the pre-formed donor 
complex is shown at the. top of the Figure. Religation 
reactions were' performed under single - turnover conditions 
as described under Materials and Methods. All DNA acceptors 
were included at a 50 -fold molar excess over the input 
CCCTT- containing substrate. (B.) Deletion formation. The 
structures of the completely base-paired 18-mer acceptor 
DNA oligonucleotide (open circle) , a 17-mer oligonucleotide 
that anneals to the donor complex to leave a 1-nucleotide 
gap (filled square) and a 16-mer- strand that anneals to 
leave a 2-nucleotide gap (square) are shown. (C) Insertion 
formation. The structures of the completely base-paired 18- 
mer acceptor (open circle), a 19-mer oligonucleotide, 
containing 1 extra 5' nucleotide (filled triangle) and a 
20-mer acceptor containing 2 extra 5' nucleotides 
(triangle) are shown. The extent of religation" is plotted 
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as a function of incubation time. 

Figure 7. Analysis of deleted and inserted DNA 

strand transfer products. Religation to acceptors with 
5 recessed and protruding 5 1 ends was performed as described 

in the legend to Fig. 6. The reaction products were 
analyzed by electrophoresis through a 17% polyacrylamide 
gel containing 7 M urea in TBE . An autoradiograph of the 
gel is shown. The acceptor strands were as follows: no 

10 acceptor (lane 2); perfectly paired 18-mer (lanes 3 and 8); 

17-mer with a . 1 -nucleotide gap (lane 4); 16-mer with a 2- 
nucleotide gap (lane 5); 19-mer with a 1-nucleotide insert 
(lane 6) / 20-mer with a 2-nucleotide insert (lane 7) . 
Control samples containing the 5' -labeled 18-mer scissile 

15 strand but no topoisomerase were analyzed in lanes 1 and 9. 

Figure 8. 1 Strand transfer -to DNA acceptors 

containing a single 5' base mismatch. Religation reactions 
were performed under singi- - turnover conditions as 
20 described under Materials and Methods. All DNA acceptors 

were included at a 50-fold molar excess over the input 
CCCTT-containing substrate. The structures of the fully 
complementary 18-mer and the three terminal -nucleotide 
variants are shown. 



25 



Figure 9A-B. • Kinetics of intramolecular hairpin 

formation. (A) Hairpin formation without potential for 
base-pairing. DNA cleavage substrates' were prepared by 
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annealing the 5' J; P-labeled 18-mer scissile strand to a 30- 
mer complementary strand * (filled circle) or an 18-mer 
complementary strand (circle) ; the structures of the 
substrates are shown with the topoisomerase cleavage sites 
5 indicated by arrows. Reaction mixtures containing (per 20 

fil) 50 mM Tris HC1 (pH 7.5), 0.5 pmol of . DNA substrate, and 
1 pmol of topoisomerase were incubated at 37°C for 10 min. 
The mixtures were then adjusted to 0.3 M NaCl . Aliquots (20 
fxl) were withdrawn immediately prior- to adding salt (time 

10 zero) and at various intervals after adding salt; the 

reactions were quenched immediately by adding an equal 
volume of stop solution (1% SDS, 95% formamide, 20 mM 
EDTA) . The samples were heat -denatured and electrophoresed 
through a 17% polyacrylamide gel containing 7 M urea in 

15 TBE . The extent of intramolecular strand transfer 

(expressed as percent of the input labeled substrate 
converted to hairpin product) is plotted as a function of 
time after addition of NaCl . (B) Hairpin formation with 
potential for base-pairing. The structure of the 18-mer/30- 

20 mer cleavage substrate is shown, with the topoisomerase 

cleavage site indicated by an arrow. A reaction mixture 
containing (per 20 fil) 50 mM Tris HC1 (pH 7.5), 0.5 pmol of 
DNA substrate, and 1 pmol of topoisomerase was incubated at 
37°C for 2 min. The mixtures were then adjusted to 0.3 M 

25 NaCl. Aliquots (20 /il) were withdrawn immediately prior to 

adding salt (time zero) and at various intervals after 
adding salt. The extent of intramolecular strand transfer 
is plotted as a function of time after addition of NaCl. 
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Figure 10A-B., Affinity Tagging of RNA Using Vaccinia 

Topoisomerase . (A) The strand transfer reaction pathway is 
diagramed in the Figure. The biotinylated DNA substrate 
which contains a single topoisomerase recognition site is. 
immobilized on the Dynabeads (Dynal) streptavidin solid 
support. The biotin moiety (indicated by the black square) 
is introduced at the 5' end of the CCCTT-containing strand 
via standard protocols for automated oligonucleotide 
synthesis. The purified vaccinia topoisomerase is reacted 
with the bead-bound DNA to form a covalent enzyme -DNA donor 
complex, as illustrated. Enzyme not bound to DNA is 
removed by washing the beads with buffer. The strand 
transfer reaction is initiated by addition of the [ 32 P] -CMP 
labeled T7 transcript which is dephosphorylated by prior 
treatment with alkaline phosphatase. The 5' single - strand 
tail of the donor complex is complementary to the 12 
nucleotides at the 5' end of the T7 transcript. Religation 
of the covalently held biotinylated DNA strand to the T7 
transcript is observed as conversion of the 30-mer RNA to 
a product of 50 nucleotides. The mixture was incubated at 
37°C for 15 min. The beads were then recovered by 
centrif ugation, washed, and resuspended in 20 fil of buffer 
containing 0.8% SDS and 80% f ormamide . The samples were 
heated at 95°C for 5 min, centrifuged for 5 min, then the 
supernatants were electrophoresed through a 12% 
polyacrylamide gel containing 7M urea in TBE buffer. (B) 
An autoradiograph of the gel is shown in the Figure. Lane 
B (Bound) - product of the strand transfer reaction bound 
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to the Dynabeads ; lane F (Free) - supernatant from the 
strand transfer reaction. The positions of the input 30- 
mer T7 transcript and the 50-mer product are shown at ■ the 
right . 

5 . 

Figure 11. A schematic representation of a method 

of using DNA- tagged mRNA to obtain full-length gene 
sequences. Briefly, capped full-length mRNA is isolated by 
attachment to a solid support, such as by using 

10 biot inylated-capped mRNA bound to a magnetic bead 

conjugated with s t reptavidin . The isolated mRNA is 
decapped (using tobacco acid pyrophosphatase) and 
dephosphorylated (using alkaline phosphatase) then modified 
with a DNA tag using the methods outlined below. The DNA- 

15 tagged mRNA is used to generate first strand cDNA using 

reverse transcriptase and amplified using PCR . The 
amplified cDNA is then inserted into a plasmid vector. 

Detailed Description of the Invention 
20 Throughout this application, the following standard 

abbreviations are used to indicate specific nucleotides: 
C=cytosine A=adenosine U=uracil 

T=thymidine G=guanosine 

25 The present invention provides a method of covalently 

joining a DNA strand to an RNA strand comprising (a) 
forming a topoisomerase -DNA intermediate by incubating a 
DNA cleavage substrate comprising a topoisomerase cleavage 
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site with a topoisomerase specific for that site, wherein 
the topoisomerase-DNA intermediate has one or more 5' 
single-strand tails; and (b) adding to the topoisomerase- 
DNA intermediate ^ an acceptor RNA strand complementary to 
the 5' single-strand tail under conditions permitting a 
ligation of the covalently bound DNA strand of the 
topoisomerase-DNA intermediate to the RNA acceptor strand 
and dissociation of the topoisomerase, thereby covalently 
joining the DNA strand to the RNA strand. The DNA cleavage 
substrate may be created by hybridizing a DNA strand having 
a topoisomerase cleavage site to one or more complementary 
DNA strands, thereby forming a DNA" cleavage substrate 
having a topoisomerase cleavage site and a oligonucleotide 
leaving group located 3* of a scissile bond or may be a 
plasmid vector comprising a topoisomerase cleavage site. 

In an embodiment of the above-described method, the 
topoisomerase cleavage site is a sequence comprising CCCTT . 
In a preferred embodiment the topoisomerase is a vaccinia 
topoisomerase enzyme. In a further embodiment the vaccinia 
topoisomerase enzyme is a modified vaccinia topoisomerase 
enzyme. In another embodiment the DNA strand having a 
topoisomerase cleavage site is radiolabelled . In a 

preferred embodiment the radiolabel is 32 P or a 
radiohalogen . Means for radio labeling nucleotides are 
well known in the art (see Ausubel, et. al . , Short 
Protocols in Molecular Biology, 3rd ed . , Wiley, 1995/ US 
patent 5,746,997 issued 05/05/98). In another preferred 
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embodiment the DNA strand having a topoisomerase cleavage 
site is labeled with a biotin moiety or another affinity 
purification tag such as chitin binding domain, 
glutathione-S-transf erase , and the like. Methods of adding 
affinity labels to nucleotides are well known in the art 
(see Carniaci, et . al . , Genomics 37: 327-336,1996; Ausubel, 
et . al . , supra) . In an embodiment the topoi somerase - bound 
DNA intermediate and the acceptor RNA strand are ligated in 
vitro . 

The present invention provides a covalent topoi somerase - DNA 
intermediate molecule having a 5' single - strand tail. In 
an embodiment of the covalent topoisomerase -DNA 
intermediate molecule, the 5' single - strand tail comprises 
a specific sequence. In another embodiment the covalent 
topoisomerase-DNA intermediate molecule having -a 5* single- 
strand tail is generated by the above-described method of 
covalently joining a DNA strand to an RNA strand. In a 
further embodiment of the covalent topoisomerase-DNA 
-intermediate molecule having 5' single - strand tail 
generated by the above -described method of the 5' single- 
strand tail comprises a specific sequence. In another 
embodiment of the covalent topoisomerase-DNA intermediate 
molecule having a 5' single - strand tail generated by step 
(a) of the above -described method the DNA strand is 
radiolabelled . In a preferred embodiment of the covalent 
topoisomerase-DNA intermediate molecule the radiolabel is 
J/ "P or a radiohalogen . In another embodiment of the 
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covalent topoisomerase-DNA intermediate molecule having a 
5' single-strand tail generated by step (a) of the above- 
described method, the DNA strand is affinity labeled. In 
a preferred embodiment of the covalent topoisomerase-DNA 
5 intermediate molecule, wherein the affinity label is a 

biotin moiety, a chitin binding domain, a glutathione - S - 
transferase moiety, and the like. 

The present invention further provides a DNA-RNA molecule 
covalently joined by topoisomerase catalysis. 

10 The present invention provides a DNA-RNA molecule 

covalently joined by the above -described method of 
covalently joining a DNA strand to an RNA strand. In a 
preferred embodiment the covalently joined DNA-RNA molecule 
has a 5' end label. In a further embodiment the 5' end 

!5 label is 3 "P or a radiohalogen . In another embodiment the 

5' end label is a biotin moiety, a chitin binding domain, 
a glutathione-S-transf erase moiety, and the like. 

The present invention provides a covalently joined DNA-RNA 
molecule having a labeled 5' end. In a preferred 

20 embodiment of the covalently joined DNA-RNA molecule the 

5' end label is 32 P or a radiohalogen. In another preferred 
embodiment of the covalently joined DNA-RNA molecule the 
5' end label is a biotin moiety, a chitin binding domain, 
a glutathione-S -transferase moiety, and the like. 

25 The present invention further provides a method of tagging 

a 5' end of an RNA molecule comprising: (a) forming a 
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topoisomerase-DNA intermediate by incubating a DNA cleavage 
substrate comprising a topoisomerase cleavage site with a 
topoisomerase specific for that site, wherein the 
topoisomerase-DNA intermediate has one or more 5' single- 
5 strand tails; and (b) adding to the topoisomerase-DNA 

intermediate a 5" -hydroxy 1 terminated RNA molecule 
complementary to the 5' single-strand tail under conditions 
permitting a ligation of the 5 1 single - strand tail of the 
topoisomerase-DNA intermediate to the RNA molecule and 

10 dissociation of the topoisomerase, thereby forming a 5' end 

tagged DNA - RNA ligation product. The DNA cleavage 

substrate can be created, for example, by hybridizing a DNA 
strand having a topoisomerase cleavage site to a 
complementary DNA strand, thereby forming a DNA cleavage 

15 substrate having a topoisomerase cleavage site and a 

oligonucleotide leaving group located 3* of a scissile 
bond . 

The RNA molecule can be the product of in vitro synthesis 
.or can have been isolated from cells or tissues. Methods 

20 of synthesizing RNA in vitro are well known in the art 

(see, for example, Ausubel, et al, supra) . Methods of 
isolating RNA from .cells and/or tissues are also well known 
in the art (see, Ausubel, et al, supra) . Cells and tissues 
suitable for use in obtaining RNA useful in the practice of 

25 the present invention include both animal cells and plant 

cells. Particularly preferred cells include mammalian 
cells ('such as rodent cells, primate cells, and the like) 
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and insect cells. RNA may also be isolated from 

prokaryotic cells such as bacteria. 

In a preferred embodiment of the above-described method, 
the RNA molecule is a dephosphorylated after synthesis or 
isolation. In another preferred . embodiment the 

dephosphorylation is achieved by treatment of the RNA 
molecule with alkaline phosphatase. In a preferred 

embodiment the topoisomerase is a vaccinia topoisomerase 
enzyme. In another embodiment the vaccinia topoisomerase 
enzyme is a modified vaccinia topoisomerase enzyme. In a 
preferred embodiment the cleavage site comprises CCCTT . In 
another preferred embodiment the method further comprises 
introducing a biotin moiety or another affinity 
purification moiety, to the DNA cleavage substrate prior to 
step (a) . In still another preferred embodiment the method 
further comprises immobilizing the affinity purification 
tagged DNA cleavage substrate on a solid support prior to 
step (a) . In a preferred embodiment the solid support is 
a sepharose resin or magnetic beads having an affinity 
purification material, such as avidin, streptavidin , 
chitin, glutathione and the like, bound thereto. Methods 
of preparing such materials are well known in the art. In 
yet another preferred embodiment the method further 
comprises purifying a biotinylated 5' end tagged DNA - RNA 
ligation product by separating the solid support to which 
the biotinylated 5' end tagged DNA - RNA ligation product is 
immobilized from a liquid phase comprising unmodified RNA. 
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In a preferred embodiment the 5* end of the DNA cleavage 
substrate is affinity labeled. In a preferred embodiment 
the affinity label is a biotin moiety. In another 
preferred embodiment the method further comprises 
5 immobilizing the biotinylated 5' end affinity labeled DNA 

cleavage substrate on' a solid support. In a preferred 
embodiment the solid support is modified with s treptavidin . 
In another preferred embodiment the method further 
comprises purifying the biotinylated 5' end affinity 
10 labeled DNA-RNA ligation product by separating the 

s t reptavidin-modif ied solid support to which the 5' end 
tagged DNA-RNA ligation product is immobilized from a 
liquid phase comprising unmodified RNA. 

As used herein, unmodified RNA is defined as an RNA strand 
15 or strands which have not been joined covalently to a DNA 

strand . 

The present invention provides a 5' end tagged RNA 
molecule . In a preferred embodiment of the 5 ' end tagged 
RNA molecule, the tag is a DNA sequence. In a further 
20 preferred embodiment the 5 1 end tagged RNA molecule further 

comprising a 5' end label. In an embodiment the 5' end 
label is 3 ~P or a radiohalogen. In another embodiment the 
5' end label is a biotin, moiety or another affinity 
purification moiety. 

25 In an embodiment the 5' end tagging RNA molecule is 

generated by the above -described method of tagging a 5' end 
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of an RNA molecule.. In an embodiment the 5 * end tagged RNA 
molecule further comprises a 5' end label. In a further 
embodiment the 5' end label is 32 P . In another embodiment 
the 5' end label is a biotin moiety. 

In another aspect the present invention further provides a 
DNA-RNA molecule which has been joined in vitro by the use 
of a topoisomerase . 

As used herein the number of nucleotides (N) of the DNA 
cleavage substrate, designated supra as a DNA- (N) 
substrate, may be from one to four nucleotide ( s ) long. 



The present invention also provides a method of tagging a 
5' end of a capped messenger RNA comprising: a) isolating 
mRNA from cells or a tissue; b) removing an RNA cap' 
structure from the isolated mRNA , resulting in a de-capped 

15 RNA; c) dephosphorylating the de-capped RNA, thereby 

forming a de-capped and dephosphorylated RNA; d) 
constructing a DNA cleavage substrate for topoisomerase 
having a topoisomerase cleavage site and a complementary 
strand, the complementary strand having a mixed or random 

20 base composition downstream of the topoisomerase cleavage 

site, the DNA cleavage substrate being designated as a 
DNA- (N) substrate; e) cleaving the DNA- (N) substrate with 
a topoisomerase, thereby forming a covalent topoisomerase - 
DNA- (N) complex containing a 5' tail of mixed or random 

25 base composition on a noncleaved strand; and f) incubating 

the cleaved covalent topoisomerase-DNA- (N) complex with 
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the de-capped and dephosphorylated RNA formed in step (c) 
to form a 5' DNA-tagged DNA-RNA ligation product. 

In an embodiment of the above-described method, the removal 
of the RNA cap structure is by either of enzymatic 
5 treatment of the mRNA with a pyrophosphatase . or chemical 

decapping by periodate' oxidation and beta elimination. In 
a preferred embodiment the pyrophosphatase is tobacco acid 
pyrophosphatase. In another preferred embodiment the 
t opoisomerase cleavage site is CCCTT . In yet another 

10 preferred embodiment the DNA- (N) cleavage substrate has a 

biotin moiety upstream of the cleavage site and is 
designated BioDNA- (N) . In an embodiment the method further 
comprises affinity purification of the biotinylated 5' DNA- 
tagged DNA-RNA ligation product by a binding of the biotin 

15 moiety to streptavidin prior to step (e) . 

The present invention also provides a 5' tagged DNA-RNA 
ligation product generated by the method of tagging a 5' 
end of a capped messenger RNA. In an embodiment the 5' 
tagged DNA-RNA ligation product further comprises a 5* end 
20 label . In a further embodiment of the 5 ' end tagged DNA- 

RNA ligation product, the label is 3/ P. In another 
embodiment of the 5' end tagged DNA-RNA ligation product, 
the label is a biotin moiety. 

The present invention also provides a method of isolating 
25 and cloning a capped mRNA after subtraction of non-capped 

RNA comprising: a) isolating mRNA from cells or a tissue; 
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b) dephosphorylating the mRNA; c) incubating a cleaved 
topoisomerase-BioDNA- (N) complex with the dephosphorylated 
mRNA to form a 5' BioDNA- tagged DNA-RNA ligation product; 
d) removing the 5' BioDNA- tagged DNA-RNA ligation product 
and any unreacted cleaved topoisomerase -BioDNA- (N) complex 
by adsorption to streptavidin and recovering any 
nonadsorbed material, said material being enriched for RNA 
having a capped 5' end and being resistant to 
dephosphorylation in step (b) , thereby being unable to 
react with the cleaved topoisomerase -BioDNA- (N) complex; e) 
removing of the 5 ' end cap from the enriched RNA recovered 
from the nonadsorbed material in step (d) ; f ) 
dephosphorylating the de-capped RNA, thereby forming a de- 
capped and dephosphorylated RNA; g) incubating a cleaved 
topoisomerase-BioDNA- (N) complex with the de-capped and 
dephosphorylated RNA to form a 5' BioDNA- tagged DNA-RNA 
ligation product; h) affinity nurifying the 5' DNA-tagged 
DNA-RNA ligation product; and . PCR amplification of the 
decapped and dephosphorylated RNA of the DNA-RNA ligation 
product using a sense primer corresponding to a scissile 
strand of the topoisomerase cleavage substrate 5' of the 
site of cleavage and an antisense primer, said antisense 
primer being . complementary to either a 3' poly (A) tail or 
to an internal RNA sequence. In a preferred embodiment of 
the above-described method, the affinity purification in 
step (h) is by a binding of the 5 1 BioDNA- tagged DNA-RNA 
ligation product to streptavidin. In another preferred 
embodiment the removal of the RNA cap structure is by 
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either of enzymatic treatment of the mRNA with a 

pyrophosphatase or chemical. decapping by -periodate 
oxidation and beta el imination . In yet another preferred 

embodiment the pyrophosphatase is tobacco acid 
5 pyrophosphatase . 

In an embodiment of the method of covalently joining a DNA 
strand to an RNA strand, the 5* single strand tail has a 
specifically designed sequence. 

Another aspect of the present invention provides a method 
10 of targeting ligation of an RNA strand of interest within 

a mixture of RNA strands which comprises the above- 
described method of covalently joining a DNA strand to an 
RNA strand. In an embodiment of the method of targeting 
ligation of an RNA strand of interest within a mixture of 
15 RNA strands which comprises the method of covalently 

joining a DNA strand to an RNA strand, the 5' single strand 
tail provides specificity of a covalently joined DNA - RNA 
ligation product. 

In another preferred embodiment there is provided a method 
20 of obtaining a full-length gene sequence comprising: (a) 

isolating full-length mRNA; (b) attaching a DNA tag 
sequence to the isolated mRNA; and (c) synthesizing cDNA 
using the tagged mRNA as a template. 

To insure that only full- length -mRNA is used in this aspect 
25 of the invention (thus insuring the generation of a full- 

length gene sequence) it is -generally preferred that only 
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capped mRNA be isolated. Eukaryotic primary transcripts 
are modified at the initiating, or 5', nucleotide of the 
primary transcript by the addition of a 5 ' methylated cap 
(Shatkin, Cell 9:645, 1976) which, may serve to protect the 
5 mRNA from enzymatic degradation. Only full-lengch 

transcripts will be so modified. The cap structure may be 
modified, such as by adding an affinity purification tag 
such as biotin, chitin binding domain, and the like 
(Carnici, et al , supra). The affinity tagged capped mRNA 
10 can then be isolated from degraded mRNA or RNAs with poly 

A tails that are not full-length coding mRNAs . 

The affinity tagged mRNA can be separated from untagged RNA 
using affinity purification, for example by contacting the 
tagged mRNA with an affinity purification material such as 

15 a solid support complexed with streptavidin , avidin, 

chitin, glutathione, and the like. Alternatively, 
unmodified capped mRNA can be separated from RNA species 
lacking a cap by contacting the capped mRNA with a solid 
support complexed to, for example, pheny lboronic acid (see 

20 Theus and Liarakos, Biotechniques 9 < 5 ) : 610 -612 , 1990). 

Suitable' solid supports include various column 
chromatography gels, such as sepharose, agarose, and the 
like, and magnetic beads. 

Any eukaryotic cell type can serve as a source for mRNA to 
25 be used in practicing the method of the invention including 

both animal cells and plant cells. Suitable animal cells 
include mammalian cells (rodent, non-human primate, 
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primate, goat, sheep, cow, and the like) and insect cells 
(moth, Drosophila, and the like) . Methods of extracting 

mRNA from different cell types are well known in the art 
(see, for example, Ausubel, et al, supra) . 

The isolated mRNA is preferably decapped and 
dephosphorylated after isolation. Methods of decapping 
RNAs are well known in the art and include both enzymatic 
methods. (such as by using a pyrophosphatase such as tobacco 
pyrophosphatase) and chemical methods (such as periodate 
oxidation and beta elimination) . Likewise methods for 
dephosphorylation of RNA are well known in the art, for 
example by using alkaline phosphatase. 

A DNA tag sequence can be attached to the isolated full- 
length mRNA using the methods described above. A preferred 
DNA tag sequence is shown in Figure 11 both as a double 
stranded DNA cleavage substrate and as a covalent 
topoisomerase -DNA intermediate. The complementary strand 
of the topoisomerase-DNA intermediate includes a 3 ' 
overhang of from 1 to 4 nucleotides, which can be any 
•mixture of adenine, guanine, cytosine or thymine, 
designated in the figure as N . These nucleotides will base 
pair with the first 1 to 4 bases of the 5' end of the 
isolated mRNA molecule, allowing the covalently attached 
topoisomerase to catalyze the transesterif icat ion reaction 
which joins the DNA tag to the end of the RNA sequence . 
The DNA tag sequence comprises a topoisomerase recognition 
site, preferably CCCTT, and in addition may comprise- a 
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recognition site for a si te - specif ic restriction 
endonuclease, such as EcoRl , useful for the subsequent 
insertion of a cDNA molecule into an expression vector. 

The DNA-RNA molecule is used as a template for synthesis 
5 and amplification of full-length cDNA sequences, preferably 

using the polymerase chain reaction (PCR) , a technique well 
known in the art (see Ausubel, et al, supra) . Suitable 
primers include all or a portion of the 5' tag sequence of 
the DNA-RNA molecule and a gene specific 3 1 primer or an 
10 oligo dT primer. 

The amplified gene products are next isolated from the 
other components of the amplification reaction mixture. 
This purification can be accomplished using a variety of 
methodologies such as column chromatography, gel 

15 electrophoresis, and the like. A preferred method of 

purification utilizes low-melt agarose gel electrophoresis. 
The reaction mixture is separated and visualized by 
suitable means, such as ethidium bromide staining. DNA 
bands that represent correctly sized amplification products 

20 are cut away from the rest of the gel and placed into 

appropriate corresponding wells of a 96-well microtiter 
plate. These plugs are subsequently melted and the DNA 
contained therein utilized' as cloning inserts. 

The purified, amplified gene sequences are next inserted 
25 into an expression vector. A variety of expression vectors 

are suitable for use in the practice of the present 
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invention, both for prokaryotic expression and; eukaryotic 
expression. In general, the expression vector will have 
one or more of the following features: a promoter-enhancer 
sequence, a selection marker sequence, an origin of 
replication, an affinity purification tag sequence, an 
inducible element sequence, an epitope-tag sequence, and 
the like . 

Promoter-enhancer sequences are DNA sequences to which RNA 
polymerase binds and initiates transcription. The promoter 
determines the polarity of the transcript by specifying 
which strand will be transcribed. Bacterial promoters 
consist of consensus- sequences, -35 and -10 nucleotides 
relative to the transcriptional start, which are bound by 
a specific sigma factor and RNA polymerase. Eukaryotic 
promoters are more complex. Most promoters utilized in 
expression vectors are transcribed by RNA polymerase II. 
General transcription factors (GTFs) first bind specific 
sequences near the start and then recruit the binding of 
RNA polymerase II. In addition to these minimal promoter 
elements, small sequence elements are recognized 
specif ically by modular DNA -binding/ trans -act ivat ing 
proteins (eg. AP-1, SP-1) which regulate the activity of a 
given promoter. Viral promoters serve the same function as 
bacterial or eukaryotic promoters and either provide a 
specific RNA polymerase in trans (bacteriophage T7) or 
recruit cellular factors and RNA polymerase (SV40, RSV, 
CMV) . Viral promoters are preferred as they are generally 
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particularly strong promoters. 

Promoters may be, furthermore, either constitutive or, more 
preferably, regulatable (i.e., inducible or derepressible) . 
Inducible elements are DNA sequence elements which act in 
5 conjunction with promoters and bind either repressors (eg. 

lacO/LAC Iq repressor system in E. coli) or inducers (eg. 
gall / GAL4 inducer system in yeast). In either case, 
transcription is virtually "shut off" until the promoter is 
derepressed or induced, at which point transcription is 
10 "turned-on". 

Examples of constitutive promoters include the int promoter 
of bacteriophage X, the bla promoter of the p-lactamase gene 
sequence of pBR322, the CAT promoter' of the chloramphenicol 
acetyl transferase gene sequence of pPR325, and the like. 

15 Examples of inducible prokaryotic promoters include the 

major right and left promoters of bacteriophage ( P L and P ? } , 
the trp, reca, lacZ, Lad, AraC and gal promoters of E. 
coli, the a-amylase (Ulmanen Ett at., J. Bacteriol . 
162:176-182, 1985) and the sigma-28-speci f ic promoters of 

20 B. subtilis (Gilman et al . , Gene sequence 32:11-20(1984)), 

the promoters of the bacteriophages of Bacillus (Gryczan, 
In: The Molecular Biology of the Bacilli, Academic Press, 
Inc., NY (1982)), Streptomyces promoters (Ward et at., Mol . 
Gen. Genet. 203:468-478, 1986), and the like. Exemplary 

25 prokaryotic promoters are reviewed by Glick (J. Ind. 

Microtiot. 1:277-282, 1987 ) ; . Cenatiempo (Biochimie 68:505- 
516, 1986); and Gottesman (Ann. Rev. Genet. 18:415-442, 
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1984) . 

Preferred eukaryotic promoters include, for example, the 
promoter of the mouse metal lothionein I gene sequence 
(Hamer et'al,, J. Mol . Appl . Gen. 1:273-288, 1982); the 
5 TK promoter of Herpes virus (McKnight, Cell 31:355-365, 

1982); the SV40 early promoter (Benoist et al . , Nature 
(London) 290:304-310, 1981); the yeast gall gene sequence 
promoter (Johnston et al., Proc . Natl. Acad. Sci. (USA) 
79:6971-6975, 1982); Silver et al . , Proc. Natl. Acad. 
10 Sci. (USA) 81:5951-5955, 1984), the CMV promoter, the EF-i 

promoter, Ecdysone-responsive promoter (s), and the like. 

Selection marker sequences are valuable elements in 
expression vectors as they provide a means to select, for 
growth, only those cells which contain a vector. Such 

15 markers are of two types: drug resistance and auxotrophic. 

A drug resistance marker enables cells to detoxify an 
exogenously added drug that would otherwise kill the cell. 
Auxotrophic markers allow cells to synthesize an essential 
component (usually an amino acid) while grown in media 

20 which lacks that essential component. 

Common selectable marker gene sequences include those for 
resistance to antibiotics such as ampiciilin, tetracycline, 
kannamycin, bleomycin, streptomycin, hygromycin, neomycin, 
Zeocin™, and the like. Selectable auxotrophic gene 
25 sequences include, for example, hisD, which allows growth 

in histidine free media in the presence of histidinol. 
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A preferred selectable marker sequence for use in yeast 
expression systems is URA3 . Laboratory yeast strains 
carrying mutations in the gene which encodes orotidine-5' - 
phosphate decarboxylase, an enzyme essential for uracil 
5 biosynthesis, are unable to grow in the absence of 

exogenous uracil. A copy of the wild-type gene (ura4+ in 
S. pombe and URA3 in S. cerevisiae) will complement this 
defect in trans . 

A further element useful in an expression vector is an 
1° origin of replication sequence. Replication origins are 

unique DNA segments that contain multiple short repeated 
sequences that are recognized by multimeric origin-binding 
proteins and which play a key role in assembling DNA 
replication enzymes at the origin site. Suitable origins 
!5 of replication for use in expression vectors employed 

herein include E. coli oriC, 2/u and ARS (both useful in 
yeast systems), sfl, SV40 (useful in mammalian systems) , 
and the like. 

Additional elements that can be included in an expression 
20 vector employed in accordance with the present invention 

are sequences encoding affinity purification tags or 
epitope tags. Affinity purification tags are generally 
peptide sequences that can interact with a binding partner 
immobilized on a solid support. Synthetic DNA sequences 
25 encoding multiple consecutive single amino acids, such as 

histidine, when fused to the expressed protein, may be used 
for one-step purification of the recombinant protein by 
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high affinity binding to a . resin column, such as nickel 
sepharose. An endopept idase recognition sequence can be 
engineered between the polyamino acid tag and the protein 
of interest to allow subsequent removal of the leader 
peptide by digestion with Enterokinase, • and other 
proteases. Sequences encoding peptides such as the chitin 
binding domain (which binds to chitin) , glutathione-S- 
transferase (which binds to glutathione), biotin (which 
binds to avidin and strepavidin) , and the like can also be 
used for facilitating purification of the protein of 
interest. The affinity purification tag can be separated 
from the protein of interest by methods well known in the 
art, including the use of inteins (protein sel f-spl icing 
elements, Chong, et al, Gene 192:271-281, 1997). 

Epitope tags are short peptide sequences that are 
recognized by epitope specific antibodies. A fusion 
protein comprising a recombinant protein and an epitope tag 
can be simply and easily purified using an antibody bound 
to a chromatography resin. The presence of the epitope tag 
furthermore allows the recombinant protein to be detected 
in subsequent assays, such as Western blots, without having 
to produce an antibody specific for the recombinant protein 
itself. Examples of commonly used epitope tags include V5, 
glutathione-S-transf erase (GST) , hemaglutinin (HA) , the 
peptide Phe-His-His-Thr-Thr , chitin binding domain, and the 
like. 
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multiple cloning site or polylinker. Synthetic DNA 
encoding a series of restriction endonuclease recognition 
sites is inserted into a plasmid vector downstream of the 
promoter element. These sites are engineered for 

convenient cloning of DNA into the vector at a specific 
position. 

The foregoing elements can be combined to produce 
expression vectors useful in creating the libraries of the 
invention. Suitable prokaryotic vectors include plasmids 
such as those capable of replication in E. coil (for 
example, pBR322, ColEl, pSClOl, PACYC 134, itVX, pRSET, 
pBAD (Invitrogen, Carlsbad, CA) and the like) . Such 
plasmids are disclosed by Sambrook (cf. "Molecular 
Cloning: A Laboratory Manual", second edition, edited by 
Sambrook, Fritsch, & Maniatis, Cold Spring Harbor 
Laboratory, (1989)). Bacillus plasmids include pC194, 
pC221, pT127, and the like, and are disclosed by Gryczan 
(In: The Molecular Biology of the Bacilli, Academic Press, 
NY (1982), pp. 307-329). Suitable Streptomyces plasmids 
include plJlOl (Kendall et, al . , J. Bacterid. 169:4177- 
4183,1987), and streptomyces bacteriophages such as <J)C31 
(Chater et al . , In: Sixth International Symposium on 
Actinomycetales Biology, Akademiai Kaido, Budapest, Hungary 
(1986), pp. 45-54). Pseudomonas plasmids are reviewed by 
John et al. (Rev. Infect. Dis. 8:693-704, 1986), and 
Izaki (Jpn. J. Bacterid. 33:729-742, 1978). 

Suitable eukaryotic plasmids include, for example, BPV, 
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vaccmia, SV40,. 2-micron circle, pcDNA3.1, pcDNA3 . 1 / GS / 
pYES2/GS, pMT, p IND, pIND(Spl) , pVgRXR (Invitrogen) , and 
the like, or their derivatives. Such plasmids are well 
known in the art (Botstein et al., Miami Wntr. Symp. 
19:265-274, 1982; Broach, In: "The Molecular Biology of the 
Yeast Saccharomyces : Life Cycle and Inheritance'', Cold 
Spring Harbor Laboratory, Cold Spring Harbor, NY, p. 445- 
470, 1981; Broach, Cell 28:203-204, 1932; Dilon et at., J. 
Clin. Hematol. Oncol. 10:39-48, 1930; Maniatis, In: Cell 
Biology: A Comprehensive Treatise, Vol. 3, Gene Sequence 
Expression, Academic Press, NY, pp. 563-608,1980. 

Once plasmids containing the gene sequence insert in the 
correct orientation have been identified, plasmid DNA is 
. prepared for use in the transformation of host cells for 
expression. Methods of preparing plasmid DNA and 

transformation of cells are well known to those skilled in 
the art. Such methods are described, for example, in 
Ausubei, et al, supra. 

Prokaryotic hosts are, generally, very efficient and 
convenient for the production of recombinant proteins and 
are, therefore, one type of preferred expression system. 
Prokaryotes most frequently are represented by various 
strains of E. coli. However, . other organisms may also be 
used, including other bacterial strains. 

Recognized prokaryotic hosts include bacteria such as E. 
coli and those from genera such as Bacillus, S treptomyces , 
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Pseudomonas, Salmonella, Serratia, and the like. However, 
under such conditions, the polypeptide will not be 
glycosylated. The prokaryotic host selected for use herein 
must be compatible with the replicon and control sequences 
5 in the expression plasmid. 

Suitable hosts may often include eukaryotic cells. 
Preferred eukaryotic hosts include, for example, yeast, 
fungi, insect cells, and mammalian cells either in vivo, or 
in tissue culture. Mammalian cells which may be useful as 

10 hosts include HeLa cells, cells of fibroblast origin such 

as VERO, 3T3 or.CHOKl, HEK 293 cells or cells of lymphoid 
origin (such as 32D cells) and their derivatives. 
Preferred mammalian host cells include * nonadherent cells 
such as CHO, 32D, and the like. Preferred yeast host cells 

15 include S, pombe, Pichia pastoris, S. cerevisiae (such as 

INVScl), and the- like. 

In addition, plant cells are also available as hosts, ana 
control sequences compatible with plant cells are 
available, such as the cauliflower mosaic virus 35S and 

20 19S, nopaline synthase promoter and polyadenyla t ion signal 

sequences, and the like. Another preferred host is an 
insect cell, for example the Drosophila larvae. Using 
insect cells as hosts, the Drosophila alcohol dehydrogenase 
promoter can be used. Rubin, Science 24 0:14 53-14 59, 1988). 

25 Alternatively, baculovirus vectors can be engineered to 

express large amounts of peptide encoded by a desire gene 
sequence in insects cells (Jasny, Science 238:1653, 1987); 
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Miller et al., In: Genetic Engineering (1986), Setlow, 
J.K., ez al., eds M Plenum, Vol. 8, pp. 277-297) . The 
present invention also features the purified, isolated or 
enriched versions of the expressed gene products produced 
by the methods described above. 

This invention will be better understood from the 
Experimental Details which follow. However, one skilled in 
the art will readily appreciate that the specific methods 
and results discussed are merely illustrative of the 
invention as described more fully in the claims which 
follow thereafter. 

Experimental Details 

METHODS AND MATERIALS 

Preparation of Tandem RNA-p-DNA and DNA- p- RNA 
Oligonucleotides . 

CCCTT-containing 3.6-mer oligonucleotides containing a 
single internal --P-label at the scissile phosphate .were 
prepared by ligating two 18-mer strands (synthetic RNA or 
DNA oligonucleotides) that had been hybridized to a 
complementary 36-mer DNA strand. The sequence of the 
proximal CCCTT-containing 18-mer strand was 5'- 
CATATCCGTGTCGCCCTT as DNA or 5' -CAUAUCCGUGUCCCUU as RNA. 
The sequence of the dxstal 18-mer strand was 5' 
AT T C C G A.T AG T G AC T AC A as DNA or 5' -AUUCCGAUAGUGACUACA as RNA. 
The distal 18-mer strand was 5' -labeled in the presence of 
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[Y 32 P] ATP and T4 polynucleotide kinase, then gel-purified. 
The sequence of the 36-mer str.and was 5' 
TGTAGTCACTATCGGAATAAGGGCGACACGGATATG . The strands were 
annealed in 0.2 M NaCl by heating at 65°C for 2 min, 
5 followed by slow-cooling to room temperature. The molar 

ratio of the 5 f -labeled distal 18-mer to the proximal 18- 
mer and the 36-mer strand in the hybridization mixture was 
1:4:4. The singly nicked product of the annealing reaction 
was sealed in vitro with purified recombinant vaccinia 

10 virus DNA ligase (14, 15). The ligation reaction mixtures 

(400 ul) contained 50 mM Tris HC1 (pH 8.0), 5 mM DTT 10 mM 
MnCl 2 , 1 mM ATP, 10 pmol of 5' 32 p-labeled nicked substrate, 
and 160 pmol of ligase. After incubation for 4 h at 22°C, 
the reactions were halted by the addition of EDTA to a 

15 final concentration of 25 mM . The samples were extracted 

with phenol-chloroform and the labeled nucleic acid was 
recovered from the aqueous phase by ethanol precipitation. 
The 36-mer duplex products were dissolved in TE buffer (10 
mM tris HC1, pH 8.0, 1 mM EDTA). Ligation of the labeled 

20 . 18-mer distal strand to the unlabeled CCCTT-containing 18- 
mer strand to form an internally labeled 36-mer product was 
confirmed by electrophoresis of the reaction products 
through a 17% denaturing polyacrylamide. gel. The extends 
of ligation [ 36-mer/ ( 36-mer + 18-mer)] were as follows: 

25 DNA- p- DNA (88%); DNA-p-RNA (67%); RNA-p-DNA (66%). 

Covalent Binding of Topoisomerase to Internally Labeled 36- 
mer duolexes . 
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Recombinant . vaccinia topoisomerase was expressed in 
bacteria and purified via phosphocellulose and SP5PW column 
chromatography as described (16, 17). Reaction mixtures 
for assay of covalent adduct formation contained (per 20 
jjl) 50 mM Tris-HCl (pH 8.0), 0.2 pmol of 36-mer duplex, and 
1 pmol of topoisomerase. The reactions were initiated by 
adding topoisomerase and halted by adding SDS to 1% final 
concentration. the samples were analyzed by SDS-PAGE. 
Covalent complex formation was revealed by the transfer of 
radiolabeled polynucleotide to the topoisomerase- 
polypeptide (3). The extent of adduct formation was 
quantitated by scanning the gel using a FUJIX BAS1000 
phosphorimager and was expressed as the percent of the 
input 5' ^p-labeled 36-mer substrate that was covalently 
transferred to protein. 

DNA Strand Transfer to an RNA Acceptor . 

An 13-mer CCCTT-containing DNA oligonucleotide (5' 
CGTGTCGCCCTTATTCCC) was 5' end-labeled in the presence of 
[ y 32 P j ATP and T4 polynucleotide kinase, then gel-purified 
and hybridized to a complementary 30-mer strand to form the 
18-mer/30-mer suicide cleavage substrate. Covalent 
topoisomerase-DNA complexes were formed in a reaction 
mixture containing (per 20 ul) 50 mM Tris-HCl (pH 8.0), 0.5 
pmol of 18-mer/30-mer DNA, and 2.5 pmol of topoisomerase. 
The mixture was incubated for 5 min at 37°C. The strand 
transfer reaction was initiated by adding an 13-mer 
acceptor strand 5' - AT T CCG AT AG TG AC T AC A (either DNA or RNA) 
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to a concentration' of 25 pmol/20 pi (i.e., a 50-fold molar 
excess over the input DNA substrate) , while simultaneously 
adjusting the reaction mixtures to 0.3 M NaCl . The 
reactions were halted by addition of SDS and formamide to 
5 0.2% - and 50%, respectively. The samples were heat- 

denatured and then electrophoresed through a 17% 
poiyacrylamide containing 7 M urea in TBE (90 mM Tris- 
borate, 2.5 mM EDTA) . The extent of strand transfer 
(expressed as the percent of input labeled DNA converted to 
10 a 30-mer strand transfer product) was quantitaied by- 

scanning the wet gel with a phosphor imager . 

Preparation of 32 p-labeled 36-mer RNA . 

A 36-nucleotide run-off transcript was synthesized in vitro 
by T3 RNA polymerase from a pBluescript II-SK(-) plasmid 

15 template that had been linearized by digestion with 

endonuclease Eagl . A transcription reaction mixture (100 
pl> containing 40 mM Tris HC1 (pH 8.0), 6 mM MgCl-, 2 mM 
spermidine, 10 mM NaCl, 10 mM DTT, 0.5 mM ATP, 0 . 5 mM CTP, 
0.5 mM UTP, 6.25 pM [a 32 P] GTP, 5 pg of template DNA, and 

20 100 units of T3 RNA polymerase (Promega) was incubated for 

90 min at 37°C. The reaction was halted by adjusting the 
mixture to 0.1% SDS, 10 mM EDTA, and 0.5 M ammonium 
acetate. The samples were extracted with phenol-chloroform 
and ethanol-precipitated . The pellet was resuspended in 

25 formamide and electrophoresed through a 12% poiyacrylamide 

gel containing 7M urea in TBE. The radiolabeled 3 6-mer RNA 
was localized by autoradiography of the wet gel and eluted 
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■ from an excised gel slice by soaking for 16 h at 4°C in 0.4 
ml of buffer containing 1 M ammonium acetate, 0.2% SDS, and 
20 mM EDTA . The eluate was phenol-extracted and ethanol- 
precipitated . The RNA was resuspended in TE . 

5 Dephosphorylation of the RNA 5' terminus was carried out in 

a reaction mixture (30 ul) containing 10 mM Tris HC1 (pH 
7.9), 50 mM NaCl, 10 mM MgCl 2 , 1 mM DTT , 10 pmol of 36-mer 
RNA, and 30 units of calf intestine alkaline phosphatase 
(New England Biolabs) . After a 1 h incubation at 37°C, the 
10 mixture was phenol-extracted and ethanol-precipitated . The 

phospha tase-treated 36-mer transcript was., repurified 
elec trophoret ically as described above. 

Affinity Tagging of RNA Using Vaccinia Topoisomerase 

The strand transfer reaction pathway is diagrammed in 
Figure 10a. The biotinyiated DNA Substrate which contains 
a single topoisomerase recognition site is immobilized on 
the Dynabeads (Dynal) streptavidin solid support. The 
biotin moiety (indicated by the black square) is introduced 
at the 5' end of the CCCTT-containing strand via standard 
protocols for automated oligonucleotide synthesis. The 
purified vaccinia topoisomerase is reacted with the bead- 
bound DNA to form a covalent ' enzyme-DNA donor complex, as 
illustrated. Enzyme not bound to DNA is removed by washing 
the beads with buffer. The strand transfer reaction is 
initiated by addition of the [ 32 P]-CMP labeled T7 transcript 
which is dephosphorylated by prior treatment with alkaline 
phosphatase. The 5' single-strand tail of the donor 
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complex is complementary to the 12 nucleotides at the 5' 
end of the T7 transcript. Religation of the covalently 
held biotinylated DNA strand to the T7 transcript is 
observed as conversion of the 30-mer RNA to a product of 50 
5 nucleotides. 

Experimental Details : The DNA substrate was formed by 
annealing the biotinylated 25-mer strand containing the 
topoisomerase recognition site to a complementary 5' 
phosphorylated 24-mer strand (present at a 4-fold molar 

10 excess). The strands were annealed" in the presence of 0.2 

M NaCl by heating at 65 C C for 10 min, followed by slow 
cooling to room temperature. The biotinylated duplex was 
immobilized on streptavidin beads by incubating 10 pmol of 
the DNA with 10 pg of Dynabeads in 50 mM Tris-HCl (pH 8.0), 

15 1 M NaCl for 10 min at 22°C. The beads were recovered by 

centrif ugation . The beads were rinsed twice with 1 ml of 
50 mM Tris-HCl (pH 8.0) . The washed beads were resuspended 
in 20 pi of 50 mM Tris-HCl (pH 8.0) . A 5-fold molar excess 
of topoisomerase (50 pmol) was added to the bead- linked DNA 

20 substrate. The mixture was incubated at 37 °C for min. The 

beads were recovered by centrif ugation, rinsed twice with 
1 ml of 50 mM Tris-HCl, then resuspended in 18 pi of 50 mM 
Tris-HCl, 0.3 M NaCl. Strand transfer was initiated by 
addition of 1 pmol of [ 3: P]-CMP labeled T7 transcript. The 

25 mixture was incubated at 37°c for 15 min. The beads were 

then recovered by centrif ugation, washed, and resuspended 
in 20 pi of buffer containing 0.8% SDS and 80% formamide. 
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The samples were heated at 95°C for 5 rain, centrifuged for 
5 rain, then the supernatants were elect rophoresed through 
a 12% polyacrylamide gel containing 7M urea in TBE buffer. 
An autoradiograph of the gel is shown in Figure 10B. Lane 
5 B (Bound) - product of the strand transfer reaction bound 

to the Dynabeads; lane F (Free) - supernatant from the 
strand transfer reaction. The positions of the input 30- 
mer T7 transcript and the 50-mer product are shown at the 
right . 

10 RNA substrate : The 30-nucleot ide runoff transcript was 

synthesized in vitro by T7 RNA polymerase . from a 
pBluescript II-SK(-) . plasmid template that had been 
linearized by digestion with endonuclease Xhol . The 
transcript was labeled with [cr 2 P]-CTP under similar 

15 reaction conditions as described for preparation of the T3 

RNA transcript. The 30-mer RNA was gel-purified and 
subsequently dephosphorylated as described. 

RESULTS 

Covalent Binding of Topoisomerase to a Duplex Substrate 
20 Containing RNA 3' of the Scissile Phosphate . 

Vaccinia topoisomerase does not bind covalently to CCCT7- 
containing RNA duplexes; nor does it form a covalen: 
complex on RNA-DNA hybrid duplexes in which one of the two 
strands is RNA (9) . Control experiments showed that the 
25 failure to form a covalent adduct on a CCCUU-containing RNA 

strand was not caused by uracil substitution for the 
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thymine bases in the CCCTT sequence (9) . To better 
understand why vaccinia topoisomerase does not form a 
covalent complex with all-RNA strands, we prepared 36-bp 
duplex substrates in which the scissile strand was a tandem 
RNA-DNA or DNA-RNA copolymer and the noncleaved strand was 
all-DNA (Fig. 1). • These duplexes were uniquely labeled 
with 3 "P at the scissile phosphodies ter . The substrate 
molecules were constructed by annealing two 18-mer 
oligonucleotides (one of which had been 5' -P-labeled) to 
a complementary 36-mer DNA strand to form a singly nicked 
duplex. The 5' -labeled 18-mer strand was then joined to 
the unlabeled CCCTT-strand (or CCCUU strand) in a reaction 
catalyzed by vaccinia virus DNA ligase. The 36-mer duplex 
products were isolated and then used as substrates for 
vaccinia DNA topoisomerase. We will refer to these 
substrates as DNA- p- DNA, DNA-p-RNA, and RNA-p-DNA, with the 
labeled phosphate being denoted by p. 

Transesterif ication by topoisomerase at the CCCTT site will 
result in covalent binding of a -3' 32 P-labeled 18-mer 
oligonucleotide to the enzyme. The extent of covalent 
complex formation on the DNA-p-RNA substrate in 10 min was 
proportional to input topoisomerase; 80-85% of the 36-mer 
strand was transferred to~ the topoisomerase at saturating 
enzyme (Fig. 1) . The same level of topoisomerase 

covalently bound less than 1% of the RNA-p-DNA 36-mer' 
strand. Hence, the topoisomerase tolerated RNA 

substitution downstream of the scissile phosphate, but was 
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impedecl from forming, the covalent adduct . when • the CCCTT 
sequence was in RNA form. 

The kinetics of the covalent binding reaction at a 
saturating level of topoisomerase were assessed (Fig. 2). 
5 An all-DNA 36-mer (DNA-p-DNA) was bound to an endpoint of 

21% in 2 min (Fig. 2A) . The apparent cleavage-rel igat ion 
equilibrium constant (K c - - covalent complex/noncovalent " 
complex) was 0.26, which agrees with values of 0.2 to 0.25 
reported previously for equilibrium cleavage of a 5' enc- 

10 labeled CCCTT-containing DNA substrate (10, 11) . The DNA- 

p-RNA 36-mer was bound covalently to an endpoint of 30% in 
5 min (Fig. 2A, and other data not shown) . The apparent 
equilibrium constant for DNA- p- RNA (K C1 = 4) was 
significantly higher than that observed for the all-DNA 

15 ligand. 

The RNA-p-DNA 36-mer was transferred to the topoisomerase, 
albeit very slowly. After 4 h, 43 of the CCCUU-containing 
RNA strand was bound covalently to the enzyme (Fig. 2B) . 
An endpoint was not established in this experiment. 

20 However, by comparing the initial rate of covalent adduct 

formation on RNA- p- DNA (0.04% of input substrate cleaved 
per min) to the amount adduct formed on DNA-p-DNA at the 
earliest timepoint (12?- in 10 sec), it is estimated that 
RNA substitution of the CCCTT-por t ion of the substrate 

25 slowed the rate of covalent complex formation by about 

three orders of magnitude. 
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DNA Strand Transfer to an RNA Acceptor . 

Rejoining of the cleaved strand occurs by attack of a 5' 
hydroxyl terminated polynucleotide on the 3' phosphodiester 
bond between Tyr-274 and the CCCTT site. This 
transesterif ication step can be studied independent of 
strand cleavage by assaying the ability of a performed 
topoisomerase-DNA complex to religate the covalently held 
strand to a heterologous acceptor strand (5, 11) . To form 
the covalent topoisomerase-DNA donor complex, the enzyme 
was initially incubated with a suicide substrate consisting 
of a 5' 3i P-labeled 18-mer scissile strand 

(CGTGTCG CCCTT ATTCCC) hybridized to a 30-mer strand. 
Cleavage of this DNA by topoisomerase is accompanied by 
dissociation of the 6-nucleotide leaving group, ATTCC . 
With no readily available acceptor for religation, the 
enzyme is essentially trapped on the DNA as a suicide 
intermediate (Fig. 3). In a 5 min reaction in enzyme 
excess, >90% of the 5' 32 P-labeled strand becomes covalently 
bound to protein. The strand transfer reaction was 
initiated by adding a 50-fold molar excess of an 18-mer 
acceptor strand (either DNA or RNA) complementary to the 5' 
single-strand tail of the covalent donor complex (Fig. 3), 
while simultaneously increasing the ionic strength to 0.3 
M NaCl. Addition of NaCl during the religation phase 
promotes dissociation of the topoisomerase after strand 
closure and prevents recleavage of the strand transfer 
product. Ligation of the covalently held 12-mer 
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CGTGTCGCCCTT to the 18-mer yields a 32 P-labeled 30 mer (Fig. 
4,, lane 1). The suicide intermediate transferred 94s of 
the input CCCTT-cont aining strand to the 18-mer DNA strand 
(Fig. 3). The extent of religation at the earliest time 
5 point (5 sec) was 90% of the endpoint value. From this 

datum a religation rate, constant ( k ro , ) of >0.5 sec ~ l was 
calculated. A k rel value of "1.3 sec'" had been determined 
previously (from experimental values for A' c: and i£ at 
37°C) (18) . 

10 Topoisomerase readily ligated the covaiently held 12-mer 

DNA to an 18-mer RNA acceptor to form a 30-mer product 
(Fig. 4, lane 5). 89% of the input CCCTT-strand was 
transferred to RNA, with 40% of the endpoint value attained 
in 5 sec. This datum was used to estimate a rate constant 

15 of 0.1 sec" 1 for single- turnover strand transfer to RNA. 

Thus, religation to DNA was about 10 times faster than 
religation to RNA. . The slowed rate of RNA religation is 
likely to account for the observed increase in the 
cleavage-religat ion equilibrium constant. (K az! '= A' cl /Jc rs: ) on 

20 the DNA- p- RNA 3 6-mer. 

Analysis of the Strand Transfer Reaction Product 

The predicted product of strand transfer to RNA is a 30-mer 
tandem DNA-RNA strand (5' - CGTGTCGCCCTTAUUCCGAUAGUGACUACA) 
uniquely 32 P-labeled at the 5' end. The structure of this 
25 molecule was confirmed by analysis of the susceptibility of 

this' product to treatment with NaOH. The labeled 30-mer 
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RNA ligation product was converted nearly quantitatively 
into a discrete species that migrated more rapidly than the 
input 13-mer CCCTT-containing DNA strand (Fig. 4 lane 6) . 
The mobility of this product was consistent with a chain 
5 length of 13 nucleotides. The expected 32 P-labeled alkaline 

hydrolysis product of the RNA strand transfer product is a 
13-mer (5' -CGTGTCGCCCTTAp) . Control reactions showed that 
neither the 3; P-labeled 18-mer scissile strand of the 
suicide substrate nor the 30-mer product of strand transfer 
10 to DNA was susceptible to alkali (Fig. 4, lanes 4 and 2) . 

It is concluded that topoisomerase can be used to iigate 
RNA to DNA in vitro. 

DNA ligand Tagging of an RNA Transcript Sy n thesized In 
Vitro by T3 RNA Polymerase . 

15 Practical applications of topoisomerase-mediated strand 

transfer to RNA include the 5' tagging of RNA transcripts. 
Bacteriophage RNA polymerases have been used widely to 
synthesize RNA polymerases have been used widely to 
synthesize RNA in vitro from plasmid DNA templates 

20 containing phase promoters. To test whether such 

transcripts were substrates for topoisomerase-catalyzed 
ligation, we constructed a CCCTT-containing suicide 
cleavage substrate that, when cleaved by topoisomerase, 
would contain a 5' single-strand tail complementary to the 

25 predicted 5' sequence of any ■ RNA transcribed by T3 RNA 

polymerase from a pBluescript vector (Fig. 5). A 36- 
nucleotide T3 transcript was synthesized in a transcription 
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reaction containing [a^P] GTP . The RNA was treated with 
alkaline phosphatase to dephosphoryla te the 5' terminus. 
The topoisomerase-DNA covalent intermediate was formed on. 
an unlabeled suicide substrate. Incubation : of the 

5 radiolabeled T3 transcript with the suicide intermediate 

resulted in the conversion of the 36-mer RNA into a novel 
species that migrated more slowly during polyacrylamide gel 
electrophoresis (not shown) . The apparent size of this 
product (48 nucleotides) was indicative of ligation to the 

10 12-mer CCCTT DNA strand. The kinetics of DNA ligation -o 

the T3 transcript are shown in Fig. 5. The reaction was 
virtually complete within 1 min; -at its endpoint 29% of the 
input RNA had been joined to DNA. No DNA- RNA ligation 
product was formed in reaction containing a T3 transcript 

15 that had not been treated with alkaline phosphatase (noc 

shown) . 

Formation of Insertions and Deletions - A Kinetic Analysis . 

The acceptor polynucleotides used in the experiments 
described above were capable of hybridizing perfectly with 

20 the 5' single-strand tail of the topoisomerase-DNA doner 

complex. It had been shown previously that the vaccinia 
virus topoisomerase is capable of joining the CCGTT-strar.d 
to an acceptor oligonucleotide that hybridizes so as to 
leave a single nucleotide gap' between the covalently bound 

25 donor 3' end and the 5' terminus of the acceptor.' 

Religation across this gap generated a 1. base deletion in 
the product compared to the input scissile . strand (5). The 
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enzyme also catalyzes strand transfer to an acceptor 
oligonucleotide that, when hybridized, introduces an extra 
nucleotide between the donor 3' end and the penultimate 
base-paired nucleotide of the acceptor. Religation in this 
5 case will produce a 1 base insertion (5) . Deletion and 

insertion formation in vitro have also been documented for 
mammalian type I topoisomerase (19). However, there has 
been no report of the effects of acceptor strand gaps and 
insertions on the rate of strand joining by these enzymes. 

10 The kinetics of strand transfer by the vaccinia 

topoisomerase covalent intermediate to acceptor 
oligonucleotides that base-pair to the donor complex no 
form either a fully base-paired 3' duplex segment, or 3' 
duplexes with a 1-nucleotide gap, or a 2-nucleotide gap, 

15 were assessed. 84% of the input DNA substrate was ligated 

to the fully-paired acceptor in 10 sec, the earliest time 
analyzed (Fig. 6A) . The size of the strand transfer 
product was 30 nucleotides, as expected (Fig. 7, lane 3; . 
No 30-mer product was formed in the absence of the added 

20 acceptor strand (Fig. 7, lane 2) . 

Religation across a 1-nucleotide gap was highly efficient, 
albeit slow. 85% of the input substrate was joined across 
a 1-nucleotide gap to yield the expected ' 29-nucleotide 
product (Fig. 6A and Fig. 7, lane 4) . The kinetic data in 
25 Fig. 6 fit well to a single exponential with an apparent 

rate constant of 0.005 sec* 1 . Thus, single-turnover strand 
closure by topoisomerase across a 1-nucieotide gap was two 
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orders of magnitude slower than the rate of joining across 
a fully paired nick. Vaccinia topoisomerase catalyzed 
strand transfer across a 2-nucleotide gap to form the 
anticipated 2 8-nucleot ide product (Fig. 7, lane 5) , buz 
5 this reaction was feeble (Fig. 6A) . Linear accumulation cf 

the 2-nucleotide gap product was observed over a 2 h 
incubation, at which time only 10% of the input DNA had 
been joined. It was estimated based on the initial rate 
..that religation across the 2-nucleotide gap was two orders 
10 of magnitude slower than joining across a 1-nucleotide gap 

(and hence four orders of magnitude slower than the rate cf 
joining across a nick) . 

Similar experiments were performed using DNA acceptors that: 
contained either 1 or 2 extra nucleotides at their 5' ends 

15 (Fig. 6C) . Religation to these acceptors yielded labeled 

strand transfer products' of 31 and 32 nucleotides, 
respectively (Fig. 7, lanes 6 and 7) . 90% of the input DNA 
was religated to form the 1-nucleotide insertion product: 
(Fig. 6C) . A rate constant of 0.04 sec~\ for religation 

20 with i-nucleotide insertion was calculated. A similar 

endpoint was achieved in the formation of a 2-nucleotide 
insertion product, but. the strand transfer rate was 
considerably slower (Fig. 6C) . The observed rate constant 
for 2-nucleotide insertion was 0.0001 sec" 1 , i.e., three 

25 orders of magnitude lower, than Jc rel at a nick. 

Ef'fect of 5' Acceptor Base Mismatch on Strand Transfer. 
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Strand transfer by topoisomerase to a set of 18-mer 
acceptors that were capable of base-pairing with the 5' 
tail of the donor complex from positions -2 to -IS 
(relative to the scissile +1 T : A base pair of the CCCTT 
5 element), but which have a base-mismatch at the -1 position 

immediately 3' of the scissile bond, was examined. The 
control acceptor, which has a normal -1 A: T base-pair, 
reacted to completion in 10- sec; 89% of the endpoint was 
achieved in 5 sec (Fig. 8). DNAs containing T:T, C:T, or 
] 0 G:T mispairs at the -1 position supported the same extent 

of strand transfer; 11% of the endpoint was attained in 5 
sec in each case (Fig. 8) . Thus, within the limits of 
detection of this experiment, mismatch at the -1. position 
had little effect on the strand transfer reaction. There 
15 are clear and instructive differences between the effects 

of base mismatches versus a single nucleotide deletion on 
the rate of the. strand joining step. 

Kinetics of Intramolecular Hairpin Formation . 

In the absence of an exogenous acceptor oligonucleotide, 
the 5' -OH terminus of the nonscissiie strand of the 12- 
mer/30-mer covalent complex can flip back and act as the 
nucleophile in attacking the DNA- ( 3-phosphotyrosyl ) bond 
(5) . The reaction product is a hairpin molecule containing 
a 12-bp stem and an 1 8-nucleot ide loop. The kinetics of 
this reaction were examined under single turnover 
conditions. In the experiment shown in Fig. 9A, 65% of the 
input CCCTT strand was converted to hairpin product in 3 h. 
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Th e observed rate constant was 5.7 x 10"* sec" 1 . In 
.parallel, the rate of hairpin formation by the covalent 
complex formed on an 18 -bp cleavage substrate (Fig. 9A) was 
analyzed. In this case, attack by the 5' -OH of the 
nonscissile strand yielded a hairpin molecule containing a 
12 -bp stem and a 6-nucleotide loop. 69% of the input CCCTT 
strand was" converted to hairpin product in 10 h. The 
observed rate constant was 8.2 x ltr : sec" : . Thus, the 13- 
nucleotide 5' tail was -7 times more effective than the 8- 
mer 5' tail as the attacking nucleophile for strand 
transfer in cis. Note that hairpin formation, by these 
covalent complexes occurs without any potential for base- 
pairing by the single-strand tails. 

In order to examine the contribution of base-pairing to the 
rate of religation, the 5' terminal and penultimate bases 
of bottom strand of the 18-mer/30-mer substrate to 5' -AT 

(Fig. 9B) were altered. Now, the 5' -terminal three bases 
of the bottom strand (5' -ATT) are identical to the 5' - 
terminal bases of the leaving strand (5' -ATTCCC) ; hence, 
the single-strand tail is self -complementary and capable of 
forming three base-pairs adjacent to the scissile 
phosphate. Intramolecular hairpin formation on this DNA 
was extremely fast; the. reaction was complete in 10-20 sec 

(Fig. 9B) . The observed religation rate constant was 0.2 
sec" 1 . By comparing this value to the religation rate 
constant on the non-complementary 1 8-mer / 30-mer substrate 

(Fig. 9A) , it was surmised that 3 base-pairs accelerated 
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the reaction -350-fold. 

Kinetics of Single-Turnover Cleavage of a CCCTT-contain : nq 
Hairpin Molecule 

The 42-nucleotide 5' .^P-labeled hairpin product was gel- 
5 purified and tested as a substrate for covalent adduct 

formation by the vaccinia topoisomerase . 55% of the input 
radioactivity was transferred to the topoisomerase 
polypeptide in 15 sec at 37°C; an endpoint of 90% transfer 
was attained in 60 sec (data not shown) . The apparent rate 
10 constant for cleavage of the hairpin was 0.06 sec"'. Thus, 

the topoisomerase rapidly and efficiently cleaved a CCCTT- 
containing molecule in which there were no standard paired 
bases downstream of the scissile phosphate. The hairpin 
cleavage rate constant is about one-fifth of k, z , on the 18- 
15 mer/30-mer suicide substrate, which contains five paired 

bases of duplex DNA 3' of the CCCTT site. 

DISCUSSION 

Vaccinia topoisomerase catalyzes a diverse repertoire of 
strand transfer reactions. Religation of the covalently 
bound DNA to a perfectly base-paired acceptor DNA 
oligonucleotide provides a model for the strand closure 
step of the DNA relaxation reaction. Here, the kinetics of 
strand transfer to alternative nucleic acid acceptors are 
analyzed. The findings provide new insights into the 
parameters that affect transesterif ication rate, illuminate 
the potential for topoisomerase to generate mutations in 
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vivo, and suggest practical applications of vaccinia 
topoisomerase as an RNA modifying enzyme. 

Sugar Specificity for Covalent Adduct Formation Resides 
within the CCCTT Element . 

5 Vaccinia topoisomerase is apparently incapable of binding 

covalently to CCCUU-containing RNA strands. This is the 
case whether the CCCUU strand is part of an RNA- RNA or an 
RNA-DNA duplex (9). It has now been shown that the sugar 
specificity of the enzyme is attributable to a stringent 

10 requirement for DNA on the 5' side of the scissile 

phosphate, i.e., the CCCTT site must be DNA. Moreover/ the 
CCCTT element must be a DNA- DNA duplex, because earlier 
experiments showed that a CCCTT strand is .not cleaved when 
annealed to a complementary RNA strand (9) . The RNA-DNA 

15 hybrid results are informative, because they suggest that 

the CCCTT site must adopt a B-form helical conformation in 
order to be cleaved. RNA and DNA polynucleotide chains 
adopt different conformations within an RNA-DNA hybrid, 
with the RNA strand retaining the A- form helical 

20 conformation (as found in dsRNA) while the DNA strand 

adopts a conformation that is neither strictly A nor B, but 
is instead intermediate in character between these two 
forms (20, 21.) . Vaccinia topoisomerase makes contacts with 
the nucleotide bases of the CCCTT site in the major groove 

25 (9, 22). It also makes contacts with specific phosphates 

of the CCCTT site (23). Adoption by the CCCTT site of a 
non-B conformation may weaken or preclude these contacts. 
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The finding that vaccinia topoisomerase is relatively 
insensitive to the nucleotide sugar composition downstream 
of the scissile phosphate implies that the conformation of 
the helix in this portion of the ligand is not important 
5 for site recognition or reaction chemistry. Topoisomerase 

cleaves DNA-p-RNA strands in which the leaving strand is 
RNA. Indeed, the extent of cleavage at equilibrium is 
significantly higher than that achieved on a DNA-p-DNA 
strand. 

10 Strand Transfer to RNA, . 

The increase in the cleavage-rel igat ion equilibrium 
constant K eq (= Jc, / Jc^ ) on the DNA-p-RNA substrate can be 
explained by the finding that the rate of single- turnover 
RNA religation * rei , R:;Ai is about one-tenth o& Lf3 k, 
15 Nonetheless, the extent of religation to RNA is quite high, 

i.e., -90% of the input CCCTT strand is religated to an 13- 
mer RNA acceptor strand in a 2 min reaction. It is shown 
that a CCCTT-containing DNA strand can be rapidly joined by 
topoisomerase to a transcript synthesized in vitro by 
20 bacteriophage RNA polymerase; -30% of the RNA is 

transferred to the DNA strand in a 2-5 min reaction. This 
property can be exploited to 5' tag any RNA for which the 
5' terminal RNA sequence is known, i.e./ by designing a 
suicide DNA cleavage substrate for vaccinia topoisomerase 
in which the nonscissile strand is complementary to the 5' 
sequence of the intended RNA acceptor. Some practical 
applications include: (i) -P-labeling of the 5' end of RNA 
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and (ii) affinity labeling the 5' end of RNA, 'e.g., by 
using a biotinylated topoisomerase cleavage substrate. A 
potential avantage of ■ topoisomerase-mediated RNA strand 
joining (compared with the standard T4 RNA ligase reaction) 
is that ligation by topoisomerase can be targeted by the 
investigator to RNAs of interest within a complex mixture 
of RNA molecules. 

Frame-Shift and Missense Mutagenesis 

It was reported earlier that vaccinia topoisomerase can- 
religate to complementary DNA acceptors containing recessed 
ends or extra nucleotides, thereby . generating the. 
equivalent of frame-shift mutations (5) . Similar reactions 
have been described by Henningfeld and Hecht (19) for the 
cellular type I topoisomerase. A key question is whether 
these aberrant religation reactions are robust enough to 
implicate topoisomerase as a potential mutagen in vivo. The 
kinetic analysis suggests that . they are and provides the 
first clue as to what spectrum of frame-shift reactions are 
most likely to occur (taking into account only the 
intrinsic properties of the topoisomerase). For the 
vaccinia enzyme, the hierarchy of frame-shift generating 
religation reactions is as follows: +1 insertion > -1 
deletion > +2 insertion >> -2 deletion. 

The slowest of these topoisomerase catalyzed reactions is 
strand closure across a 2-nucleotide gap (initial rate = 
0.002% of input DNA religated/sec) . In this situation, the 
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attacking nucleophile is held in place at some distance 
from the DNA-protein phosphodies ter by base-pairing to the 
nonscissile strand. Moving the 5' hydroxyl one base-pair 
closer to the phosphodies ter enhances reaction rate by a 
factor of 100. Extra on-paired nucleotides appear to pose 
much less of an impediment to strand joining to form 1- or 
2 nucleotide insertions. The active site of the 

topoisomerase may be able to accommodate extrahelical 
nucleotides; alternatively these nucleotides may 
intercalate into the DNA helix at the topcisomerase-induced 
nick. ' 



15 



There are two potential pathways for topoisomerase to form 
minus frame-shifts in vivo, which differ as to how the 
acceptor strand is generated: (i) the 5' end of the leaving 
strand can be trimmed by a nuclease, after which ligation 
could occur across the resulting gap; or (ii) a homologous 
DNA single strand attacks the covalent intermediate. the 
second pathway presumably requires a helicase in order to 
form the invading strand (and perhaps also to displace the 
20 leaving strand) : In the case of plus frame-shifts, only 

the latter pathway would be available to the topoisomerase, 
i.e., because no mechanism exists to add nucleotides to the 
5' terminus of the original leaving strand. No matter 
which pathway is taken, it is reasonable to assume that the 
most rapidly catalyzed mutagenic strand- joining reactions 
are the ones most likely to make their mark in vivo. If 
the religation reaction is slow, as for -2 frame-shifting, 
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then the cell . has greater opportunity to repair the 
mutagenic lesion, e.g., by removing the covalently bound 
topoisomerase. This could entail: (i) excision of a patch 
of the DMA strand to which the topoisomerase is bound; or 
5 (ii) hydrolysis of the topoisomerase-DNA adduct-. An enzyme 

that catalyzes the latter reaction was discovered recently 
by Yang et al . (24 ) . 

Introducing a base mismatch at the -1 position immediately 
flanking the scissile phosphate has almost no effect on the 

10 rate of religation.. This result is in stark contrast to 

the 10' : rate effect of a 1-nucleotide gap. It is inferred 
that the -1 base mismatches do not significantly alter the 
proximity of the 5' -hydroxyl nucleophile ■ of the terminal 
nucleotide to the scissile phosphate at. enzyme's active 

15 site. The results indicate clearly that topoisomerase has 

the capacity to generate missense mutations in vitro. The 
single-strand invasion pathway involved above for frame- 
shift mutagenesis could, -in principle, provide the 
opportunity for topoisomerase to create missense mutations 

20 in vivo. The kinetics of ligation in vitro suggest that 

topoisomerase-generated missense mutations would 
predominate over frame-shifts. 

The Kinetic Contribution of Base Complementarity ' 

Kinetic analysis of intramolecular hairpin formation by the 
25 vaccinia topoisomerase provides the first quantitative 

assessment of the role of base complementarity in strand 
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closure. The rate constant for attack on the DNA- (3' - 
phosphotyrosyl ) bond by a non-pairing 1 3-nucleotide single 
strand linked in cis to the covalent complex was 5.7 x 10" ; 
sec " : . Altering only the terminal bases of the single- 
5 strand tail to allow base-pairing at the -1, -2, and -3 

positions increased the rate constant for hairpin formation 
by 350-fold. The rate of religation in cis with 3 
potential base-pairs was nearly the same as the rate of 
religation to a non-covalently linked acceptor strand that 

10 forms 18 base pairs 3' of the scissile bond. The ability 

of the covalently bound enzyme to take up and rapidly 
rejoin DNA strands with only three complementary 
nucleotides lends credence to the suggestion that vaccinia 
topoisomer'ase catalyzes the formation of recombination 

15 intermediates in vivo (25), either via strand invasion or 

by reciprocal strand transfer between two topoisomerase-DNA 
. complexes . 

Generation of Gene Sequences 
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The use of a DNA-tagged RNA to clone gene sequences was 
evaluated using 96 base test RNA fragment of known sequence 
(GGG AGA CCC AAG CTC GCC CGG TTC TTT TTG TCA AGA CCG ACC 
TGT CCG GTG CCC TGA ATG AAC TGC AGG ACG AGG CAG CGC GGC TAT 
CGT GGC TGG) . This test RNA was synthesized using a T7 
Invitrotranscription kit from Ambion Co. using protocols 
25 supplied by the manufacturer. 

A topoisomerase-DNA intermediate was generated as follows: 
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25 pi of streptaviciin conjugated Dynabeads (Dynal) were 
washed twice with 25 1 of 2X B&W buffer (10 mM Tris pH 7.5, 
1. mM EDTA, 2 M NaCl) in an eppendorf tube then resuspended 
in 50 ul IX B&W buffer. 1.5 pg of a biotinylated oligo 
5 (TOPOB1) and .75 pg of two annealing oligos (TOPOP2, 

TOPOP3) were added tc the beads and heated to 70° C for 5 
minute's, then cooled on ice for 2 minutes. The beads were 
then washed twice with 25 pi each of NEB #1 buffer (New 
England Bioiabs - iOmM Bis Tris Propane-HCl, lOmM 
10 MgC12 / imMDTT pH7 . 0 @ 25°) to remove any unannealed 

oligonucleotides. The oligonucleotides were synthesized by 
Dalton Biochemicals (Canada) and had the following 
sequences : 

TOPOB1 - 5' B-GTTTTGGCTCCCATATACGACTCGCCCTTNTTCCGATAGTG 
15 TOPOP2 - 5' -NAAGGGCGAGTC 

TOPOP3 - 5' -CACTATCGGAA. 

The 5' end of TOPOB1 was biotinylated by using a 
biotinylated guanine nucleotide during that round of 
automated synthesis. 

After the annealing step, the DNA substrate was modified 
using vaccinia . topoisomerase basically as previously 
described. .Approximately 2.5 g of vaccinia Topoisomerase 
1 was added to the beads in 25 pi of ! IX NEB #1 buffer. 
This mixture was placed on a rotating wheel for 5 minutes 
at roorr. temperature then washed three times with 25 pi of 
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IX NEB #1 buffer. Approximately 100-200ng of the 96mer RNA 
was added to the washed topoisomerase-DNA intermediate 
bound beads in 10 pi, then 15 pi of 0.5 M NaCl (final cone. 
0.3 M) was "added, and the tube was rotated for 5 minutes at 
5 room temperature. 

The DNA-tagged RNA bound beads were next washed twice with 
IX RT buffer (cDNA Cycle Kit, Invitrogen, Carlsbad, CA, 
cat. # L1310-01), primed with RT96 (synthesis of firs: 
strand) and PCR performed using the cDNA Cycle Kit 
10 according to the manufacturer's instructions and primers 

P.CR96 and PCR53 . 

RT96 - 5' -CCACGATAGCCGCGCT 

PCR9 6 - CGTCCTGCAGTTCATTCAG 

PCR53 - GGCTCCCATATACGACTC 

15 The reaction cycles were as .follows: 2 minutes at 94 C C, 

then 25 - 35 cycles (10 sec/cycle) 94°C, 55°C and 72°C, 
followed by 5 minutes at 72°C. The resulting amplified cDNA 
was inserted into a plasmid vector using a TOPO™TA cloning 
Kit (Invitrogen, Carlsbad, CA, cat. #K4500-01) used 

20 according to the manufacturer's instructions. 

While the foregoing has been presented with reference :o 
particular embodiments of the invention, it will be 
appreciated by those skilled in the art that changes in 
these embodiments may be made without departing from the 
25 principles and spirit of the invention, the scope of which 
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is defined by the appended claims. 
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What is claimed is 

1. A method of covaiently joining a DNA strand to an RNA 
strand comprising : 

(a) forming a topoisomerase-DNA intermediate by 
incubating a DNA cleavage substrate comprising a 
topoisomerase cleavage site with a topoisomerase 
specific for that site, wherein the 
topoisomerase-DNA intermediate has one or more 5 ' 
single-strand tails; and 

(b) adding to the topoisomerase-DNA intermediate an 
acceptor RNA strand complementary to the 5' 
single-strand tail under conditions permitting a 
ligation of the covaiently bound DNA strand of 
the topoisomerase-DNA intermediate to the RNA 
acceptor strand and dissociation of the 
topoisomerase, thereby covaiently joining the DNA 
strand to the RNA strand. 

2. A method of claim 1, wherein the DNA cleavage 
substrate is created by hybridizing a DNA strand 
having a topoisomerase cleavage site to a 
complementary DNA strand, thereby forming a DNA 
cleavage substrate having a topoisomerase cleavage 
site and a oligonucleotide leaving group located 3' of 
a scissile bond. 

DNA cleavage 
comprising a 

4. The method of claim 1, wherein the topoisomerase 



10 



15 



A method of claim 1, wherein the 
substrate is a plasmid vector 
topoisomerase cleavage site. 
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cleavage site is a sequence comprising CCCTT . 

5. The method of claim 1, wherein the topoisomerase is a 
vaccinia topoisomerase enzyme. 

6. The method of claim 1, wherein the DNA strand 
5 comprising a topoisomerase cleavage site is 

radiolabelled . 

I, The method of claim 6, wherein the radiolabel is :i,? P or 
a radiohalogen . 

8. The method of claim 1, wherein the DNA strand having 
10 a topoisomerase clea- ..ge site is labeled with a biotin 

mo i e t y . 

9. The method of claim 1, wherein the topoisomerase-bound 
DNA intermediate and the acceptor RNA strand are ^ 
ligated _in vitro . 

15 10. A topoisomerase -DNA intermediate molecule comprising 

one or more 5' s ingle- s t rand tails. 

II. The topoisomerase-DNA intermediate molecule of claim 
1C, wherein the 5' single-strand tail comprises a 
speci f ic sequence . 

20 12. A topoisomerase-DNA intermediate molecule comprising 

a 5' single-strand tail generated by seep "(a) of the 
method of claim 1 . 



A topoisomerase-DNA intermediate molecule comprising 
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a 5' single-strand tail generated by steps (a) of the 
method of claim 1, wherein the 5' single-strand tail 
comprises a specific sequence. 

A topoisomerase-DNA intermediate molecule comprising 
a 5' single-strand tail generated by steps (a) of the 
method of claim 1, wherein the DNA strand is 
radiolabelled. 

The topoisomerase-DNA intermediate molecule of claim 
13, wherein the radiolabel is 3:; P or a radiohalogen. 

A topoisomerase-DNA intermediate molecule comprising 
a 5' single-strand tail generated by steps (a) of the 
method of claim 1, wherein the DNA strand is affinity 
labeled . 

The topoisomerase-DNA intermediate molecule of claim 
16, wherein the affinity label is a biotin moiety, a 
chitin binding domain or a glutathione-S- trans f erase 
mo i e t y . 

A DNA-RNA molecule covalently joined by topoisomerase 
catalysis. 

A DNA-RNA molecule covalently joined by the method of 
claim 1 . 



The covalently joined DNA-RNA molecule of claim 19, 
having a 5' end label. 



The covalently joined DNA-RNA molecule of claim 20, 
wherein the 5' end label is 32 P or a radiohalogen. 
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22. The covalently joined DNA-RNA molecule of claim 20, 
wherein the 5' end label is a biotin moiety, a chitin 
binding domain, or a glutathione-S- trans f erase moiety. 

23. A covalently joined DNA-RNA molecule having a labeled 
5 5' end. 

24. The covalently .joined DNA-RNA molecule of claim 23, 
wherein the 5' end label is 3;: P or a radiohalogen . 

25. The covalently joined DNA-RNA molecule of claim 23, 
wherein the 5' end label is a biotin moiety, a chitin 

10 binding domain, or a glutathione-S- transferase moiety. 

26. A method of tagging a 5' end of an RNA molecule 
comprising: 

(a) forming a topoisomerase-DNA intermediate by 
incubating a DNA cleavage substrate comprising a 

15 topoisomerase cleavage site with a topoisomerase 

specific for that site, wherein the 
topoisomerase-DNA intermediate has one or more 5' 
single-strand tails; and 

(b) adding to the topoisomerase-DNA intermediate a 
20 S'-hydroxyl terminated RNA molecule complementary 

to the 5' single-strand tail under conditions 
permitting a ligation of the covalently bound DNA 
strand of the topoisomerase-DNA intermediate to 
the RNA molecule and dissociation of the 
25 topoisomerase, thereby forming a 5' end tagged 

DNA-RNA ligation product. 

27. A method of claim 26, wherein the 5' -hydroxy! 
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terminated RNA molecule is the product of in vitro 
synthesis or isolation from cells or tissues. 

The method of claim 27, wherein the RNA molecule is 
dephosphorylated after synthesis or isolation. 



5 29. The method of claim 28, wherein the dephosphory la t ion 

is achieved by treatment of the RNA molecule with 
alkaline phosphatase. 

30. A method of claim 26, wherein the DNA cleavage 
substrate is created by hybridizing a DNA strand 
10 having a topoisomerase cleavage site tc a 

complementary DNA strand, thereby forming a DNA. 
cleavage substrate having a topoisomerase cleavage 
site and a oligonucleotide leaving group located 3' of 
a scissile bond. 

15 31- The method of claim 26, wherein the topoisomerase is 

a vaccinia topoisomerase enzyme. 

32. The method of claim 26, wherein the cleavage sire 
comprises CCCTT. 

33. The method of claim 26, wherein the DNA comprises a 5' 
20 end label . 

34. The method of claim 33, wherein the 5' end label is a 
biotin moiety, .a chitin binding domain, or a 
glutathione-S-transf erase moiety . 

35. The method of claim 33, further comprising 
25 immobilizing the 5' end labeled DNA on a solid support 

prior to the addition of the 5'-hydroxyl terminated 
RNA molecule . 



36 



The method of claim 35, wherein the solid support 
comprises streptavidin, avidin, chitin or glutathione. 
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37. The method of claim. 35, further comprising, purifying 
a biotinylated 5' end tagged DNA-RNA ligation product 
by separating the solid support to which the 5' end 
labeled DNA-RNA ligation product is immobilized from 

5 a liquid phase comprising unmodified RNA* 

38. A 5' end tagged RNA molecule. 

39. The 5' end tagged RNA molecule of claim 38, wherein 
the tag is a DNA sequence. 

40. The 5' end tagged RNA molecule of claim 39, further 
10 comprising a 5' end label. 

41. The 5' end tagged RNA molecule of claim 41, wherein 
the label is y ' [ P or a radiohalogen . 

42. The 5' end tagged RNA molecule of claim 43, wherein 
the label is a biotin moiety, a chitin binding domain, 

15 or a glutathione-S-transf erase moiety. 

43. A 5' end tagged RNA molecule generated by the method 
of claim 26. 

44. A DNA-RNA molecule which" has been joined in vitro by 
the use of a topoisomerase . 

20 45. A method of obtaining full-length gene sequences 

comprising : 

(a) isolating full-length mRNA; 

(b) attaching a DNA tag sequence to the isolated 
mRNA; and 

25 (c) synthesizing cDNA using the tagged mRNA as a 

template. 

46. A method of claim 45, wherein the mRNA is isolated by 
employing an affinity purification material. 

47. A method of claim 46, wherein the mRNA to be isolated 
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comprises an affinity purification tagged cap 
structure . 

48. A method of claim 46, wherein the affinity 
purification tag is a biotin moiety, a chitin binding 
domain or a glutathione-S- trans f erase moiety. 

49. A method of claim 46, wherein the affiniry 

r 

purification material comprises a solid support 
complexed with phenylboronic acid, streptavidin, 
avidi'h, chitin or glutathione. 

50. A method of claim 49, wherein the solid support is 
magnetic beads cr sepharose. 

51. A method of claim 45 wherein the mRNA is isolated from 
plant cells or animal cells. 

52. A method of claim 51 wherein the animal cells are 
.15 mammalian cells' or insect cells. 

53. A method of claim 45, wherein the mRNA is decapped and 
dephosphorylated after isolation. 

54. A method of claim 53 wherein the mRNA, is decapped 
enzymatically or by chemical treatment. 

20 55. A method of claim 54 wherein the enzyme is a 

pyrophosphatase. 

56. A method of claim. 54 wherein the chemical treatment is 
periodate oxidation and beta elimination. 

57. A method of claim 53 wherein the mRNA is 
25 dephosphorylated using alkaline phosphatase. 

58. A method of claim 45, wherein the DNA tag sequence 
comprises a recognition site for a type I 
topoisomerase . 
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59. A method of claim 58 wherein the DNA tag sequence 
further comprises a recognition site for a site- 
specific restriction endonuclease . 

60. A method of claim 58 wherein the type I topoisomerase 
5 is vaccinia DNA topoisomerase. 

61. A method of claim 58 wherein the DNA tag' sequence 
comprises the double stranded sequence shown in Figure 
11 wherein N represents an adenosine moiety, a 
guanosine moiety, a cytosine moiety or a thymidine 

10 moiety. 

62. A method of claim 61 wherein M is 1 to 4 nucleotide 
bases . 

63. A method of claim 61 wherein vaccinia DNA 
topoisomerase is covalently bound to the double 

15 stranded tag sequence. 

64 . A method of claim 45 further comprising amplifying the 
synthesized cDNA wherein the amplification primers 
comprise an anti-coding sequence of the tag sequence 
(5' ) and a gene specific sequence (3' ) , 

20 65. A method of claim 64 further comprising inserting the 

amplified cDNA into an expression vector. 

66. A method of claim 45 wherein the DNA tag sequence is 
a linearized expression vector . 

67. At. isolated full-length gene sequence prepared by the 
25 method of claim 45. 

68. A nucleic acid construct comprising an isolated full- 
length gene sequence prepared of the method of claim 
4 5 and an expression vector. 

69. A nucleic acid construct of claim 68 wherein the 
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expression vector comprises one or more elemencs 
selected from: a promoter-enhancer sequence, a 
selection marker sequence, an origin of replication, 
an epitope-tag encoding sequence or an affinity 
purification-tag encoding sequence . 

70. A nucleic acid construct of claim 69 wherein the 
promoter-enhancer sequence is the T7 promoter, gall 
promoter, metallothionein promoter, AraC promoter, or 
CMV promoter-enhancer. 

71. A nucleic acid construct of claim 69 wherein the 
selection marker sequence encodes an antibiotic 
resistance gene. 

72. A nucleic acid construct of claim 69 wherein the 
epitope-tag sequence encodes V5, the peptide Phe-His- 
His-Thr-Thr, hemaglut inm, or glutathione-S- 
transf erase . 

73. A nucleic acid construct of claim 69 wherein the 
affinity purification-tag sequence encodes a polyamino 
acid sequence or a polypeptide. 

74. A nucleic acid construct of claim 73 wherein said 
polyamino acid sequence is polyhist idine . 

75. A nucleic acid construct of claim 73 wherein said 
polypeptide is chitin binding domain or glutathione-S- 
transf erase . 

76. A nucleic acid construct of claim 73 wherein said 
polypeptide encoding sequence includes an intein 
encoding sequence . 

77. A nucleic acid construct of claim 68 wherein the 
expression vector is a eukaryotic expression vector or 
a prokaryotic expression vector. 
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78. A nucleic acid construct of claim 77 wherein the 
eukaryotic expression vector is pYES2, pMT , pIND, or 
pcDNA3.1. 

79. A method of obtaining full-length gene sequences 
5 comprising: 

(a) isolating full-length mRNA by employing an 
affinity purification material; 

(b) decapping and dephosphorylat ing the isolated 
mRNA; 

10 (c) attaching a DNA tag sequence to the decapped, 

dephosphorylated mRNA, wherein the tag sequence- 
comprises the sequence shown in Figure 11 and is 
attached by vaccinia DNA topoi somerase ; 

(d) synthesizing cDNA using the tagged mRNA as a 
15 - template; 

(e) amplifying the synthesized cDNA, wherein the 
amplification primers comprise an anti-coding 
sequence of the tag sequence (5') and a gene 
specific sequence (3'); and 

20 (f) inserting the amplified cDNA into an expression 

vector. 
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